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EDITORIAL 


Spain's good news 


ast month finally brought good news for science 
in Spain. Newly elected prime minister Pedro San- 
chez named Spanish astronaut and aeronautical 
engineer Pedro Duque as head of a new Ministry 
for Science, Innovation, and Universities. This will 
hopefully put Spanish research and development 
(R&D) back on the country’s political agenda after 
two legislatures in which science languished under the 
Ministry of Economy. Given the minority position of the 
ruling party in both chambers 
of Congress, Duque will face 
extraordinary challenges to 
increase the budget. But the 
creation of this new ministry 
is an encouraging first step. 

Over the past 7 years, Spain 
has suffered from insufficient 
R&D spending, setting its 
entire scientific enterprise 
back by two decades. In 2016, 
R&D represented only 1.2% 
of the gross domestic prod- 
uct (GDP), compared to 2% 
for the European Union; half 
of this was accounted for by 
the private sector. During this 
decline, most public research 
institutions were left nearly 
bankrupt [including the Span- 
ish National Research Council 
(with 139 centers), the Span- 
ish Institute of Oceanography, 
the Carlos II Health Insti- 
tute, and the Research Cen- 
tre for Energy, Environment 
and Technology (CIEMAT)]. Research projects became 
paralyzed by new bureaucratic processes; scientific talent 
thinned out dramatically; employment declined to pre- 
carious levels (plagued by irregular short-term contracts); 
and the average age of tenured academics increased to 53 
years. From 2010 to 2014, Spain lost 27,358 R&D jobs— 
had funding been maintained at 2009 levels, about 61,940 
such jobs would have been created. 

Sadly, the previous Spanish government's vision of how 
R&D can change the economy was seriously flawed— 
it underestimated the importance of basic research, 
oversimplified knowledge transfer, minimized the time 
needed to bring ideas to market, and ignored how pub- 
lic investment attracts private investment. The new San- 
chez government must move away from this ill-conceived 
model. Spain’s prosperity will only come about through 


Madrid Innovation Campus, 
Villaverde district, 
Madrid, Spain. 


“Spain's prosperity will only 
come about through innovations 
based on...new knowledge.” 


innovations based on the generation of new knowledge. 
Relying on the private sector to achieve the 2020 target of 
3% GDP investment in R&D (proposed by the European 
Union in 2000) while drastically reducing public invest- 
ment is unrealistic. 

The creation of Spain’s new Ministry for Science, In- 
novation, and Universities acknowledges the important 
role of basic research and the synergy that should exist 
between research and higher education in the sciences. 
It should be further supported 
by the inclusion of universi- 
ties in Spain’s Science, Tech- 
nology and Innovation Law 
(to strengthen research at a 
national level) and by con- 
crete measures to encourage 
mobility between research 
and educational institutions. 
The latter should include 
addressing the role of the 
National Agency for Quality 
Assessment and Accreditation 
in feeding academic inbreed- 
ing in Spanish universities (af- 
fecting 70% of professors; less 
than 2% are foreigners). 

The new government cer- 
tainly needs to invest more 
in R&D, but there are urgent 
measures that can be imple- 
mented quickly that require 
more political will than fund- 
ing. These include ending bu- 
reaucratic processes imposed 
by the former Ministry of Fi- 
nance on all research activities (contracts, procurements) 
that have paralyzed the system; spending all appropriated 
R&D funds (only 50% has been spent so far); ending the 
year-long delays on announcing calls for research; ending 
limitations on continually renewing contracts (unavoid- 
able with the current 3-year grant system); exempting 
research-related contracts from lengthy competitive 
bidding processes; and ending the Prior Intervention 
Requirement that prevents research institutions from 
executing their budgets (some institutions have reported 
50% losses). These practices were intended to sequester 
R&D funds to reduce spending. 

The previous government compromised Spain’s future 
by creating a deficit in innovation and discovery. Spain 
can no longer be trapped in this failed economic model. 

—- Amaya Moro-Martin 
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46 A repudiation of established research ... in what appears 
to be an effort to curry favor with commercial interests. 99 


Public health scholars in a letter to the Trump administration, criticizing its attempts to alter 
a World Health Assembly resolution supporting breastfeeding to favor baby formula. 
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CONSERVATION 


Dams may threaten endangered apes 


Flooding could 
jeopardize 1500 
chimpanzees ina 
conservation 
preserve in Guinea. 


onservationists are protesting proposals to build two hydro- 

electric dams, one in West Africa and another in Indonesia, 

because both could reduce habitat for critically endangered 

nonhuman primates—western chimpanzees and Tapanuli 

orangutans, respectively. In Guinea, a management board was 

finalizing plans this week to construct a 294-megawatt facility 
in Moyen-Bafing National Park that wildlife experts say could kill up 
to 1500 western chimpanzees (Pan troglodytes verus), a subspecies 
whose population has fallen by more than 80% over the past 20 years. 
Flooding from the dam and associated human activity will force some 
of them to move, primatologists working with the International Union 
for Conservation of Nature report. Because chimpanzees are territo- 
rial, notes one of the scientists, Rebecca Kormos of the University of 
California, Berkeley, deadly warfare will likely break out when one 
group is pushed onto another’s turf. Kormos and colleagues have 
urged Guinea’s government to look for alternative sites for the dam or 
alternative energy sources. A dam planned for Sumatra in Indonesia 
faces similar criticism; the Tapanuli orangutan subspecies (Pongo 
tapanuliensis), discovered only last year, has a remaining population 
of just 800 apes. 
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Wellcome seeks long shots 


RESEARCH FUNDING | The world’s 
second biggest private funder of medi- 
cal research, the London-based charity 
Wellcome Trust, announced this week 
that it will provide £250 million for 
unconventional research that could 
transform science or health. The money 
is earmarked for ideas shut out from 
traditional funding because they are less 
likely to succeed, and for projects with 
researchers who lack a background in 
the life sciences but bring other exper- 
tise. Called the Wellcome Leap Fund, 

it will be set up as a subsidiary of the 
Wellcome Trust with an independent 
board; the search for a CEO will start 
shortly and its first research programs 
could start in late 2020. The fund, to be 
established over 5 years, will account for 
about 5% of Wellcome’s total spending 
during that period. 


Death from nerve agent probed 


CHEMICAL WEAPONS | A second incident 
involving a Novichok nerve agent in 

the United Kingdom has left one victim 
dead—and set abuzz the community of 
scientists who study countermeasures to 
chemical weapons. On 30 June, a couple 
fell ill in Amesbury, U.K., about 

12 kilometers north of Salisbury, where, 
4 months earlier, ex-spy Sergei Skripal 
and his daughter Yulia spent weeks 

in intensive care after exposure to a 
Novichok agent first developed by the 
Soviet Union decades ago. U.K. authori- 
ties say it’s too early to say how the 
incidents are related. Dawn Sturgess, 
44, died on 8 July; her partner, Charlie 
Rowley, 45, remains critically ill. 
Investigators have not revealed whether 
the agent that poisoned them is the same 
compound used against the Skripals and 
if so, whether it is from the same batch. 
Novichok compounds stored in pow- 

der form are thought to be stable, but 
degrade quickly when exposed outdoors. 
The notion that the same agent might 
have lingered in the environment for 
months is “incredible, and deeply dis- 
turbing,” a U.S.-based chemical defense 
researcher says. 
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Iceman’s last meal analyzed 


esearchers have provided a detailed chemical analysis of 

the last meal eaten by Otzi the lceman—the 5300-year-old 

man found frozen and naturally mummified in the Italian 

Alps in 1991—and found it was rich in fat. As described 

in previous accounts, Otzi had a completely full stomach 
when he was shot with an arrow and died. In the new study, 
published 12 July in Current Biology, scientists quantified the 


Puerto Rico’s stats agency saved 


SCIENCE POLICY | Puerto Rico has backed 
away from a controversial plan to elimi- 
nate the Puerto Rican Institute of Statistics 
(PRIS), the agency that vets data and sta- 
tistical analyses about the island. Governor 
Ricardo Rossellé had proposed bringing 
PRIS under the control of the island’s 
Department of Economic Development 
and Commerce and ultimately outsourcing 
its responsibilities to the private sector, 

as part of a broad reorganization plan. 
Critics—including AAAS, which pub- 

lishes Sctence—argued that the bill would 
undermine PRIS’s political independence 
and the trustworthiness of Puerto Rico’s 
statistics. On 30 June, the island’s leg- 
islature approved the reorganization of 
the economic department but preserved 
PRIS in its current form. The move won 
praise from Lisa LaVange, president of 

the American Statistical Association in 
Alexandria, Virginia. “Having independent, 
strong, and transparent statistics will serve 
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A high-fat diet may 
have helped Otzi 
the Iceman survive 


at high altitudes. 


ancient DNA, proteins, and other chemicals preserved in his 
stomach contents. Fat residues, which came from the red deer 
and ibex he ate, made up a remarkably high share of the total, 


the health, well-being, and economy of 
Puerto Rico and its people for decades to 
come,” she says. 


Global science body launched 


SCIENTIFIC SOCIETIES | A new interna- 
tional body aiming to be the “global voice 
for science” was launched at a meeting in 
Paris last week. The inauguration of the 
International Science Council (ISC) follows 
a decision last year to merge two existing 
bodies representing the social and natural 
sciences—the International Social Science 
Council and the International Council for 
Science—to pool resources and break down 
barriers between disciplines. The ISC, which 
will focus on supporting research related 

to international issues such as poverty and 
climate change, counts more than 180 orga- 
nizations as members, including the U.S. 
National Academy of Sciences and The Royal 
Society of the United Kingdom. At the 4 July 
meeting, South African mathematician Daya 
Reddy was elected ISC’s first president. 
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about 50%, they determined; although one meal doesn’t reveal 
a lifetime's diet, a high-fat diet may have given Otzi the energy 
he needed to survive at high altitudes. The iceman’s last meal 
was balanced with grains from einkorn wheat and traces of 

a genus of toxic ferns called bracken. Researchers say he may 
have consumed the bracken as medicine to treat parasites 
previously identified in his intestines. 


Countries in which people posted 
on social media during the first 
week of July celebrating the 
first International Day of LGBTQ+ 
People in Science, Technology, 
Engineering, and Math on 5 July. 


900 


million 
Tons of insects consumed by birds 
per year, according to an estimate in 
a study that examined habitats 
around the world and noted the 
importance of bird populations for 
keeping insect pests in check 
(The Science of Nature). 
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Ancient tools found in China 


ARCHAEOLOGY | Hominins lived in China 
more than 2 million years ago, according 
to new evidence that may push back the 
earliest dates for hominins living outside 
Africa by 250,000 years. If the dates hold, 
they will boost a minority view that a key 
human ancestor, Homo erectus, could have 
arisen in Asia, not Africa. In this week’s 
Nature, researchers report that they recov- 
ered and analyzed 96 stone points, flakes, 
and cores dropped in a gully at Shangchen 
in north-central China’s windswept Loess 
Plateau. Until now, the oldest known 

site for hominins outside of Africa was 
Dmanisi, Georgia, where fossil hominins 
and stone tools date to 1.85 million years. 
The Shangchen site lacks volcanic miner- 
als, which provide the gold standard for 
radiometric dating methods. Instead, the 
scientists used paleomagnetic dating— 
which detects known reversals in Earth’s 
magnetic field that are recorded in ancient 
rock—to date the layers of sediment con- 
taining the stone tools. The oldest layers 
date from 2.1 million years ago. 


Church backs climate action 


CLIMATE SCIENCE | The Church of 
England will, by 2023, begin to divest from 
companies that aren’t taking enough action 
against climate change, leaders agreed this 
week. The decision follows other steps by 
the church to influence the energy indus- 
try by using its £12 billion investment 
fund, which contains about £125 million 
in shares of large oil and gas companies. 
In 2015, the church divested £12 million of 
its assets from tar sands and coal projects. 
Last year, the church teamed up with the 
London School of Economics to create the 
Transition Pathway Initiative, which has 
been evaluating efforts by companies to 
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Canterbury Cathedral is the seat of the Church of England, which is mulling divesting funds over climate change. 


help reach the Paris climate agreement 
goal of keeping global warming to less than 
2°C. The church originally planned to start 
divesting from companies by 2020 if they 
didn’t pass muster, but leaders this week 
voted to delay that by 3 years to allow more 
time to pressure companies to change their 
ways. The climate divestment would be the 
largest by any church fund. 


Carrier ends monkey flights 


ANIMAL RESEARCH | The Russian carrier 
AirBridgeCargo Airlines will no longer 
transport nonhuman primates for research 
purposes, leaving Air France as the only 
major airline still carrying the animals. 

A representative of the Moscow-based 
company disclosed the decision in a 3 July 
email to People for the Ethical Treatment 
of Animals (PETA) in Norfolk, Virginia. 

In late June, PETA urged its supporters 

to ask AirBridgeCargo to stop transport- 
ing primates, mostly macaques, bound for 
research labs. More than 188,000 people 
emailed the airline. It had been shipping 
animals from China to Chicago O’Hare 
International Airport. Airline officials did 
not respond to emails requesting comment. 
Most primates imported to the United 
States for research originate in China; oth- 
ers arrive from Vietnam and Mauritius. In 
2014, the last year for which the U.S. Fish 
and Wildlife Service made such informa- 
tion available, 23,426 research primates 
were imported. 


New leader at EPA 


SCIENCE POLicy | Andrew Wheeler, a 
former energy industry lobbyist and U.S. 
Senate aide, became acting head of the 
Environmental Protection Agency (EPA) this 
week, replacing Scott Pruitt, who resigned 
on 5 July after becoming engulfed in ethics 
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scandals. Wheeler, a polished Washington 
insider who was confirmed as EPA deputy 
administrator in April, is expected to forge 
ahead with many of Pruitt’s initiatives, 
including efforts to roll back climate and 
wetlands protection rules. But his arrival 
likely marks the demise of one dream of 
EPA foes: undermining the so-called endan- 
germent finding, a 2009 legal opinion that 
obligates EPA to regulate carbon dioxide 
emissions. Wheeler has called the finding 
“settled law.” 


Newborn screening list amended 


PUBLIC HEALTH | Health and Human 
Services (HHS) Secretary Alex Azar last 
week added spinal muscular atrophy 
(SMA) to the list of disorders for which the 
government recommends that newborns be 
screened. States implement the screening 
and generally follow the federal govern- 
ment’s lead; with the addition of the fatal, 
inherited disease, which can be detected by 
a genetic test, HHS now suggests screen- 
ing for 35 such conditions. SMA destroys 
motor neurons in the brain stem and spi- 
nal cord. In its most common, most severe 
form, children die in the first few years of 
life. In December 2016, the Food and Drug 
Administration (FDA) approved the first 
therapy, nusinersen (Spinraza), made by 
Biogen of Cambridge, Massachusetts. In 
February, an advisory committee had voted 
8 to 5 to recommend that Azar add the dis- 
ease to the screening panel, but dissenters 
worried about the lack of long-term data 
on the drug’s effectiveness and side effects 
(Science, 29 June, p. 1385). 


Ontario head shrinks science 


SCIENCE POLICY | Researchers in Ontario, 
Canada’s most populous province, are 
concerned by a sudden shift in the govern- 
ment’s approach to science and 
evidence-based policy following the elec- 
tion of populist Doug Ford as the province’s 
premier. Since taking power on 29 June, 
Ford, the brother of the late Toronto Mayor 
Rob Ford, has canceled the province’s car- 
bon cap-and-trade system; eliminated the 
position of minister of science and innova- 
tion in his Cabinet; and fired the province’s 
chief scientist, who had been on the job 
only since November 2017. “It’s unfortunate 
to see Premier Ford shift robust evidence 
and science-based policy down his list of 
priorities,” said Kathleen Walsh, director 

of policy at the science campaign group 
Evidence for Democracy in Ottawa. 
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Pole operates 
neath the ice. — 


Ice reveals a messenger from a blazing galaxy 


Neutrino astronomy is born as IceCube and other instruments find likely home of particle 


By Daniel Clery 


f astronomers are right, a ghostly 

particle that lit up an instrumented 

swathe of ice beneath the South Pole on 

22 September last year was a messenger 

from a distant galaxy. The particle was 

a neutrino, electrically neutral and al- 
most massless, which means its path could 
be traced back to the extragalactic event 
that created it. Cued by IceCube, the Ant- 
arctic detector, the orbiting Fermi Gamma- 
ray Space Telescope found that the neutrino 
likely came from a far off blazar, a hugely 
bright source of radiation powered by a 
supermassive black hole. 

Astronomers have long been tantalized by 
the prospect of using neutrinos, which move 
at almost the speed of light and rarely inter- 
act with other matter, to learn about violent 
cosmic events. The new find- 


Earth come from? Known as ultrahigh-en- 
ergy cosmic rays, these particles have a mil- 
lion times more energy than has ever been 
produced in an earthbound particle accel- 
erator, but what boosts them to such colossal 
energies is unknown. Suspects have included 
neutron stars, gamma-ray bursts, hyperno- 
vae, and the radiation-spewing black holes at 
the center of some galaxies, but whatever the 
source, high energy neutrinos are a likely by- 
product (Science, 31 July 2015, p. 465). If the 
IceCube team is right, blazars could be the 
first confirmed source of these cosmic rays. 
Researchers note, however, that the 
link between the neutrino and the blazar 
isn’t rock-solid. “It’s a very mouthwatering 
observation and I very much hope it will be 
confirmed,” says Pierre Sokolsky of the Uni- 
versity of Utah in Salt Lake City. “If their 
interpretation of those observations is cor- 


rect it will be revolutionary, extraordinary,’ 
says Eli Waxman of the Weizmann Institute 
of Science in Rehovot, Israel. But, he adds, 
“an extraordinary result needs extraordi- 
nary support, and the support is not quite 
extraordinary yet.” 

Completed in 2010, the IceCube neu- 
trino detector snares these elusive par- 
ticles in a cubic kilometer of Antarctic 
ice. When a neutrino hits a nucleus in the 
frozen water molecules, other particles fly 
off in recoil; as they decelerate, they emit 
light called Cherenkov radiation, which 
some of IceCube’s 5160 light detectors may 
pick up. Based on the location, timing, and 
brightness of the detected light, research- 
ers can reconstruct the path and energy of 
the neutrino. 

Most of the neutrinos detected by IceCube 
originated nearby, spawned by cosmic rays 

hitting Earth’s upper atmo- 


ing, reported on pages 146 and 
147, could mark the founding 
event of neutrino astronomy. 
The detection also triggered a 
powerful example of another 
new trend, multimessenger 
astronomy, in which tele- 
scopes and other instruments 
studied the flaring blazar in all 
parts of the electromagnetic 
spectrum, from gamma rays to 
radio waves. 

A neutrino-producing blazar 
could also help solve a decades- 
old mystery in astronomy: 
Where do the extremely high 
energy protons and other nu- 
clei that occasionally bombard 
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A polar light show with cosmic origins 

The detection, and computed path, of neutrino lceCube-170922A. Each circle repre- 
sents one of IceCube's spherical light detectors in the Antarctic ice: size indicates the 
brightness detected, from earliest (in dark blue), to latest (in yellow). 
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sphere. IceCube researchers 
eliminate those using a variety 
of methods, leaving the very 
few, very high energy neutri- 
nos, above 30 trillion electron 
volts (TeV). In 2013, the IceCube 
team first revealed a handful of 
such events, arguing that their 
high energies and other prop- 
erties showed they must have 
come from outside of our gal- 
axy (Science, 22 November 2013, 
p. 920). The detector contin- 
ues to bag about a dozen high 
energy neutrinos a year; when 
it gets a clean track with a 
well-defined direction, other 
telescopes scramble to see if 
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there is an obvious cosmic source—until now, 
without success. 

In 2016 IceCube’s operators set up an 
alert service, with the hope of getting more 
telescopes at different wavelengths involved 
in the hunt. Then, last September, IceCube 
got lucky. A detected neutrino, dubbed 
IceCube-170922A and calculated to have an 
energy of 290 TeV, offered a relatively clear 
track back into space. An automatic alert 
went out less than a minute later. 

Several observatories initially didn’t see 
anything unusual. Six days later, the Fermi 
team reported the satellite had found that 
a blazar, known as TXS 0506+056 and just 
0.1° away from the neutrino track suggested 
by IceCube, was especially bright, having 
started flaring a few months earlier. Soon, 
more than a dozen telescopes had studied 
the blazar. Blazars, like quasars, are distant 
cosmic beacons powered by supermassive 
black holes, which generate intense radia- 
tion and fire jets of particles from their poles. 
Blazars are exceptionally bright, astrono- 
mers believe, because their jets happen to be 
aimed straight at Earth. 

IceCube and the other observers estimate 
the probability that the neutrino path and 
the blazar coincided by chance is roughly 
one in 740. Physicists and astronomers, 
however, aren’t usually convinced that two 
phenomena are connected until there’s no 
more than a one in 3.5 million, or 5-sigma, 
probability of a coincidence. 

IceCube researchers also went back 
through almost a decade of data to see 
whether an excess of high-energy neutri- 
nos had streamed from the same location 
before. They found a period of 150 days in 
late 2014 and early 2015 when IceCube de- 
tected around 13 more neutrinos than nor- 
mal from that spot. It’s not yet clear whether 
TXS 0506+056 was flaring at that time, but 
“the archival event was much more inter- 
esting” than the recent detection, says Ice- 
Cube Principal Investigator Francis Halzen 
of the University of Wisconsin in Madison. 

Sokolsky and Waxman agree that Ice- 
Cube’s September 2017 detection should 
strengthen the project’s longstanding bid 
to massively increase the size of the instru- 
ment, which would also increase how many 
neutrinos it can detect and improve its 
pointing accuracy. Since IceCube was built, 
the team has found the ice is clearer than 
previously thought, so they believe they can 
make IceCube 10 times bigger while only 
doubling the number of light detectors, 
matching the $280 million cost of its origi- 
nal construction. The team is about to start 
experiments to test that. “With a 10-times 
bigger detector, the answer [to where high 
energy neutrons come from] would be clear 
and obvious,” Waxman says. 
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SPACE 


China announces new flotilla 
of space science missions 


Probes will watch for colliding black holes, image Earth’s 
magnetosphere, and monitor the sun 


By Dennis Normile 


hina’s ambitious human space mis- 

sions get most of the headlines, but 

its fledgling space science program is 

quietly gaining strength. The Chinese 

Academy of Sciences (CAS) last week 

confirmed plans to launch four new 
scientific satellites beginning in 2020. Com- 
ing on the heels of four successful missions, 
including one devoted to x-ray astronomy 
and another that demonstrated quan- 
tum entanglement over a record-setting 
1200 kilometers, these “phase 2” projects 
will examine areas including solar physics 
and the hunt for electromagnetic signals as- 
sociated with gravitational waves. 

Given that China’s space science program 
only started about 10 years ago, the length- 
ening track record “is impressive, but there 
are still not many missions given that it’s a 
big country with a big science community,” 
says Xin Wu, a China-born physicist at the 
University of Geneva in Switzerland who 
collaborates on China’s astrophysics mis- 
sions. “There is pent-up demand” among 
Chinese space scientists, he says. 

CAS broke with tradition for one of 
the new missions, the Gravitational Wave 
High-energy Electromagnetic Counterpart 
All-sky Monitor (GECAM). It fast-tracked 
selection and development to take advan- 
tage of a new scientific opportunity, which 
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Xiong Shaolin, an astrophysicist at CAS’s 
Institute of High Energy Physics in Beijing, 
and his colleagues identified a month after 
the U.S. Laser Interferometer Gravitational- 
Wave Observatory announced its historic 
detection of gravitational waves in Febru- 
ary 2016. They proposed putting two satel- 
lites into orbit on opposite sides of Earth 
that together could watch the entire sky 
for gamma rays emanating from the events 
that generate gravitational waves. Funding 
for technical studies arrived a few months 
later, and the mission has jumped to the 
front of the launch queue, with a date of 
2020. “When you have this kind of oppor- 
tunity you can’t handle it like a normal mis- 
sion, with selection and review taking 10 or 
20 years,” Xiong says. 

So far, gamma rays and other electro- 
magnetic signals have only been detected 
from one kind of gravitational-wave source, a 
neutron star merger, but they yielded a trove 
of detail about the enigmatic event. Astro- 
physicists are still debating whether black 
hole mergers, the other confirmed source 
of gravitational waves, also produce electro- 
magnetic emissions. The GECAM team is 
betting that they do—and that much can be 
learned from the signals. “I think probably 
we will find something,” Xiong says. 

GECAM’s observations will complement 
those of another phase 2 mission, the Ein- 
stein Probe (EP), which will survey the sky 
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Two satellites, launching in 2020, will watch for gamma 
rays from the violent birth of gravitational waves. 


for the low energy x-rays associated with 
violent phenomena such as gamma ray 
bursts and black hole collisions. Combining 
GECAM, EP, and gravitational wave obser- 
vations “will allow us to better understand 
gamma ray burst astrophysics,” says Ik 
Siong Heng, an astrophysicist at the Uni- 
versity of Glasgow in the United Kingdom. 

China’s space scientists have long tar- 
geted another area: solar physics. Only the 
United States produces more papers in the 
field than China. “But [China’s] papers used 
data from missions developed by Japan and 
the U.S. and elsewhere,” says Gan Weiqun, 
a solar physicist at CAS’s Purple Mountain 
Observatory in Nanjing. He says China’s 
solar scientists have been pushing for their 
own mission for 40 years; they’ve finally 
gotten the nod for the Advanced Space- 
based Solar Observatory (ASO-S). “It’s very 
important for us to make original contribu- 
tions in terms of hardware and data,’ Gan 
says. He explains that ASO-S will be the first 
space observatory to monitor the sun’s mag- 
netic field while watching for solar flares 
and the titanic blasts known as coronal 
mass ejections. The simultaneous observa- 
tions could yield clues to how those erup- 
tions are triggered. 

The last mission included in the phase 2 
list was identified as a priority years ago 
(Science, 22 July 2016, p. 342). The Solar 
wind Magnetosphere Ionosphere Link Ex- 
plorer (SMILE), a joint CAS and European 
Space Agency mission, will pioneer a new 
technique for imaging Earth’s magneto- 
sphere. Previous satellites have made point 
measurements as they traveled through 
the magnetosphere. But scientists recently 
learned that collisions between particles of 
the solar wind and stray particles of Earth’s 
atmosphere produce low energy x-rays that 
light up the magnetosphere. By watch- 
ing these x-rays, SMILE will capture its 
dynamic behavior. 

Last week’s announcement bodes well 
for China’s space science program beyond 
the next four missions. The 4 billion yuan 
($605 million) phase 2 budget includes sup- 
port for development on future missions, 
particularly the enhanced X-ray Timing 
and Polarimetry mission (Science, 9 March, 
p. 1085), an ambitious international project 
led by Chinese scientists to study black holes, 
neutron stars, and magnetars. 

China’s planetary exploration and astro- 
naut programs will continue to make history; 
later this year or early next, for example, it 
plans to land the first probe on the far side of 
the moon. But the future of its space science 
efforts also seems assured. 
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TOXICOLOGY 


Digital chemical test impresses 


Giant database shows promise for replacing animal studies 


By Vanessa Zainzinger 


oxicologists this week unveiled a digi- 
tal chemical safety screening tool that 
could greatly reduce the need for six 
common animal tests. Those tests ac- 
count for nearly 60% of the estimated 
3 million to 4 million animals used 
annually in risk testing worldwide. 

The computerized tool—built on a mas- 
sive database of molecular structures and 
existing safety data—appears to match, and 
sometimes improve upon, the results of ani- 
mal tests for properties such as skin sensi- 
tization and eye irritation, the researchers 
reported in the 11 July issue of Toxicologi- 
cal Sciences. But it also has limitations; for 
instance, the method can’t reliably evaluate 
a chemical’s risk of causing 
cancer. And it’s not clear 
how open regulatory agen- 


with similar structures can have similar 
health effects, such as being an irritant 
(Science, 12 February 2016, p. 651). 

In their new paper, Hartung’s team goes 
bigger. First, the researchers expanded 
their database to 10 million chemical struc- 
tures by adding information from the pub- 
lic database PubChem and the U.S. National 
Toxicology Program. Next, they compared 
the structures and toxicological proper- 
ties of every possible pair of compounds 
in their database—a total of 50 trillion 
comparisons—creating a vast similarity 
map that groups compounds by structure 
and effect. Finally, they tested the model: 
They asked it to predict a randomly cho- 
sen chemical’s toxicological profile by link- 
ing it to similar “neighbors” on the map, 
and compared the results 
to six actual animal tests of 
the compound. 


6 ° ’ 

cies will be to adopting a This wont On average, the computa- 

nonanimal approach. be the end of tional tool reproduced the 
Still, “We're really ex- ‘ . 5 animal test results 87% of 

cited about the potential animal testing. the time. That’s better than 


of this model,” says toxico- 
logist Nicole Kleinstreuer, 
the deputy director of a 
center that evaluates alternatives to animal 
testing at the National Institute of Environ- 
mental Health Sciences (NIEHS) in Dur- 
ham, North Carolina. Kleinstreuer, who was 
not involved in the work, adds that using 
“big data ... to build predictive models is an 
extremely promising avenue for reducing 
and replacing animal testing.” 

Most developed nations require new 
chemicals that enter commerce to undergo 
at least some safety testing. But the long- 
standing practice of exposing rabbits, rats, 
and other animals to chemicals to evaluate 
risks is facing growing public objections 
and cost concerns, helping spur a hunt 
for alternatives. 

One approach is to use what is known 
about the safety of existing chemical 
compounds to predict the risks posed 
by new chemicals with similar molecu- 
lar structures. Two years ago, a team led 
by Thomas Hartung of the Johns Hop- 
kins Bloomberg School of Public Health 
in Baltimore, Maryland, took a step to- 
ward that goal by assembling test data 
on 9800 chemicals regulated by the 
European Chemicals Agency (ECHA) in 
Helsinki. They then showed that chemicals 
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Mike Rasenberg, 
European Chemicals Agency 


animal tests themselves can 
do, Hartung says: In review- 
ing the literature, his group 
found that repeated animal tests replicated 
past results just 81% of the time, on average. 
“This is an important finding,” Hartung says, 
because regulators often expect alternative 
methods to animal testing to be reproduc- 
ible at the 95% threshold—a standard even 
the animal tests aren’t meeting. 

The screening method has weaknesses. 
Although it can predict simple effects such 
as irritation, more complex endpoints such 
as cancer are out of its reach, says Mike 
Rasenberg, who heads the ECHA’s Compu- 
tational Assessment & Dissemination unit. 
“This won’t be the end of animal testing,” 
he predicts. Still, Rasenberg thinks Euro- 
pean regulators will accept the method for 
simple endpoints. 

In the United States, the NIEHS center 
is validating the method, and Kleinstreuer 
says if it passes, she could see U.S. regula- 
tors adopting it for some purposes. In the 
meantime, the researchers are moving to 
make the tool available to companies that 
might want to screen products before sub- 
mitting them for regulatory review. 


Vanessa Zainzinger is a science journalist 
in the United Kingdom. 
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GENOME EDITING 


A ‘gene drive’ makes its 


debut in mammals 


CRISPR-based gene-spreading strategy promises 
to speed development of engineered mice 


By Jon Cohen 


esearchers have used CRISPR, the 

genome editing tool, to speed the 

inheritance of specific genes in 

mammals for the first time. Demon- 

strated in lab-reared insects several 

years ago, this controversial “gene- 
drive” strategy offers the ability to quickly 
spread a gene throughout an entire species. 
It has sparked dreams of deploying lethal 
genes to eradicate pests such as malaria- 
carrying mosquitoes—and now, perhaps, 
crop-damaging, disease-causing mammals 
such as rabbits, mice, and rats. 

The new research aims to create novel 
strains of lab mice, not wipe out wild popu- 
lations, and it shows that gene drives work 
less efficiently in rodents than in insects. 
Still, Paul Thomas, a molecular geneticist at 
the University of Adelaide in Australia who 
is working on similar experiments, calls it 
an “important first step towards develop- 
ment of gene drive technology in mammals.” 

The study was posted 4 July on bioRxiv, an 
online site for preprints, by a team at the Uni- 
versity of California, San Diego (UCSD), led 
by geneticist Kimberly Cooper. The research- 
ers, among them Ethan Bier and Valentino 
Gantz, who 3 years ago showed that CRISPR 
could create an efficient gene drive in flies 
(Science, 24 April 2015, p. 442), have submit- 
ted the study to a peer-reviewed journal that 
asked them not to speak with media. But 
it has already triggered plenty of scientific 
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discussion. “It’s a very good study and it’s of 
pretty high significance,” says Gaétan Burgio, 
a mouse geneticist at the John Curtin School 
of Medical Research in Canberra, Australia, 
who tweeted a series of comments about the 
report. “Nothing is really known about gene 
drives in rodents. We all assumed the effi- 
ciency would be the same as in flies, but it 
turns out to be very different.” 

In a variation of the earlier fly work, the 
UCSD researchers built their gene drive by 
engineering female mice to carry the gene 
for the DNA-cutting enzyme Cas9, one of 
CRISPR’s two components; they engineered 
males to carry a gene for the other compo- 
nent, the guide RNA (gRNA) that shuttles 
Cas9 to a specific target on a genome, plus 
a gene that modifies coat color. Breeding 
the altered mice created pups that had the 
genes for both CRISPR components on dif- 
ferent chromosomes. 

After Cas9 makes its cut, a cell repairs the 
damage—and how it does so is key to the 
success of a gene drive. The cell can either 
reconnect the severed strands of DNA or 
bridge the gap by slotting in chunks of new 
DNA, a process called homology directed 
repair (HDR). A gene drive harnesses HDR 
to insert a new gene—in this case, a copy of 
the coat-color modifier gene. But cells natu- 
rally prefer to simply reconnect the severed 
DNA, which thwarts a gene drive. 

The UCSD team exploited a fundamental 
biological phenomenon to force cells to- 
ward HDR. They manipulated Cas9 to turn 
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The genome editor CRISPR can be used to engineer 
female lab mice that have increased odds of passing 
down a specific gene to offspring. 


on during meiosis, the cell division process 
that helps creates sperm or eggs. The chro- 
mosomes naturally swap DNA during meio- 
sis, and during those exchanges the cell 
only allows HDR. 

The experiment did not work in males, 
likely because spermatogonia go through 
mitotic cell division before meiosis, stymy- 
ing HDR. But in females, the gene drive 
succeeded. It copied the coat color modi- 
fying gene to the partner chromosome in 
many eggs, which would significantly raise 
the odds of offspring inheriting it. In one 
mouse, 79% of her eggs ended up with the 
color modifying gene on both chromosomes. 
If she mated with a male without the gene, 
about 90% of her pups would inherit the 
gene. (Flies have a different process of em- 
bryogenesis, one that boosts the efficiency 
of HDR and allows it to work in both sexes.) 

Cooper and her colleagues write that this 
system of “active genetic elements” could 
speed the creation of mice that have sev- 
eral introduced or crippled genes. Michael 
Wiles, who heads tech evaluation and de- 
velopment at the Jackson Laboratory in Bar 
Harbor, Maine—one of the world’s largest 
commercial producers of engineered mice— 
says the method could be “very useful,” as 
many human diseases are caused by aberra- 
tions in several genes, and producing mice 
that genetically mimic those diseases is slow 
and laborious. “I’m now getting requests to 
make mice with six modifications, and the 
breeding time becomes phenomenal,” Wiles 
says. With a gene drive like this, what takes 
5 years could be done in one, he says. 

Even though this new work aims only to 
engineer lab mice, Kevin Esvelt, an evolu- 
tionary biologist at the Massachusetts Insti- 
tute of Technology Media Lab in Cambridge, 
says it concerns him. He believes a gene 
drive that could be released into the wild 
should include a kill switch to shut it down 
and restore animals to their natural state. 
“Tt’s troubling to see that the study does not 
explicitly mention safeguards,” Esvelt says. 

The new UCSD gene drive, however, 
likely would stop spreading in a mouse pop- 
ulation after a few generations. Because the 
genes for Cas9 and the gRNA ride on dif- 
ferent chromosomes, they would gradually 
become separated and the drive would lose 
effectiveness. In the preprint, Cooper and 
her colleagues stress the continuing chal- 
lenges of creating an efficient gene drive for 
wild mammals. “[T]he optimism and con- 
cern that gene drives may soon be used to 
reduce invasive rodent populations in the 
wild is likely premature,” they conclude. & 
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ARCHAEOLOGY 


Molecular ‘barcodes’ reveal lost whale hunts 


DNA method can rapidly identify species, shows toll of ancient cultures on biodiversity 


By Elizabeth Pennisi and Michael Price 


ccording to myth, the Maori people 

arrived in New Zealand riding the 

back of a whale; today, whales fig- 

ure prominently in Maori art and 

stories. Now it appears that Maori 

ancestors may also have systemati- 
cally hunted the animals. 

Archaeological sites in New Zealand hold 
few recognizable whale bones, but a new 
technique for detecting species from short 
snippets of DNA in bone scraps has enabled 
researchers to identify the remains of five 
whale species in early New Zealand settle- 
ments, including smaller or slow-moving 
species, the likely targets for 
hunters. The same “DNA meta- 
barcode” method showed that the 
early New Zealanders may also 
have decimated populations of 
a native parrot. And on the op- 
posite side of the world, a team 
used a simpler barcode method to 
suggest that the Romans hunted 
whales too, potentially push- 
ing the first known commercial 
whaling industry back by about a 
thousand years. 

By revealing species from once- 
meaningless bone fragments, bar- 
codes are “opening new windows 
into historical ecosystems” says 
Ana Rodrigues, an ecologist at 
the University of Montpellier in 
France. And the new data are 
changing views. “For a long time, 
we thought that ancient cultures 
were technologically not advanced 
enough to significantly influence animal 
populations, but more and more evidence 
is accumulating that they actually did,” says 
evolutionary biologist Michael Hofreiter of 
the University of Potsdam in Germany. 

At Curtin University in Perth, Australia, 
DNA researcher Michael Bunce, graduate 
student Frederik Seersholm, and colleagues 
have been assessing the impact of human 
arrival in New Zealand in about 750 B.C.E. 
They developed a way to cheaply and easily 
determine the DNA barcodes of hundreds 
of bones by high throughput sequencing 
of blends of ground-up bones. Barcodes, 
named after the ubiquitous labels on items 
in stores, are short stretches of DNA that 
uniquely identify a species. Bunce’s team 
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applied their method to 5000 bone frag- 
ments from 38 sites in New Zealand span- 
ning the past 20,000 years. “It’s bones that 
never would have been identified,” says 
Rodrigues, who was not part of this study. 
Seersholm pinpointed 110 near-perfect 
matches to known species, as the group 
reported this week in the Proceedings of 
the National Academy of Sciences. Until 
now, archaeologists usually assumed that 
whale bones at sites in New Zealand were 
from pilot whales that had been scavenged 
after they beached. But the DNA barcodes 
identified orca, true dolphins, Cuvier’s 
beaked whales, fin whales, and southern 
right whales. “We think they were hunting 


Scientists retrieved the DNA signatures of whales and other species from bone 
fragments like these in New Zealand. 


the smaller whales,” driving the mammals 
into shallow water with boats and then har- 
pooning them, Seersholm says. 

The barcodes also suggested the early 
New Zealanders hunted fur seals, sea lions, 
and elephant seals. And the method identi- 
fied several terrestrial species including the 
kakapo, a now-endangered ground-dwelling 
New Zealand parrot. Further analysis of 
kakapo DNA showed that its genetic diver- 
sity plunged once New Zealand was settled. 
“Tt really shows that high impact humans 
had on the biodiversity,” Seersholm says. 

In the Gibraltar region by the Mediter- 
ranean Sea, Rodrigues and her colleagues 
took a focused approach, analyzing 11 puta- 
tive whale bones from five sites. There, the 
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Romans fished tuna from villages dating 
to between 400 B.C.E. and 425 C.E. Cryp- 
tic texts referring to “sea monsters” or 
“ram fishes,” plus large bones suspected 
to be whale (and often shaped into tools), 
suggested the Romans also hunted coast- 
hugging gray and right whales. But these 
behemoths don’t ply those waters today. 

Rodrigues’s team combined tried-and- 
true barcoding with collagen fingerprinting, 
a technique that matches amino acids in 
samples’ connective tissue to known animal 
families. They found that 10 of the 11 bones 
belonged to whales—three gray, three right, 
one fin, one pilot, one sperm, and one dol- 
phin. The preponderance of gray and right 
whales, which are reliably found 
close to shore and thus could have 
been hunted, raises the possibil- 
ity of a forgotten 2000-year-old 
Roman whaling industry, the 
researchers suggest this week 
in the Proceedings of the Royal 
Society B. That’s about 1000 years 
before Basque whalers, long 
considered the first commercial 
whalers, emerged. 

If the Romans did hunt gray 
and right whales from Gibraltar, 
the authors note, they may have 
played a role in the species’ disap- 
pearance from the eastern North 
Atlantic. But other scholars say 
it’s more likely that the Romans 
scavenged beached whales. 

The identification of the 11th 
presumed whale bone threw the 
researchers for a loop: It belonged 
to an elephant. That’s “why it’s 
good to use these methods,” Rodrigues says. 
“Who knows how many other bones out 
there have been misidentified?” 

Barcoding has limitations. It can’t iden- 
tify a species if its sequence isn’t included 
in the barcode database, which is rap- 
idly expanding but still incomplete. And 
barcodes can’t yet show how abundant a 
species was—only that it was present. 

But Rodrigues, for one, is in awe of bar- 
coding’s power. “It’s like a miracle,” she says. 
“You send them a little bit of dust and they 
send you back results.” & 


Michael Price is a freelance journalist 
and a magazine editor at San Diego 
State University in California. 
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SUNSHINE 


Ammonia made from sun, ait, 
and water could turn Australia into 
a renewable energy superpower 


By Robert F. Service 
in Sydney, Brisbane, 
and Melbourne, Australia 


he ancient, arid landscapes of Aus- 
tralia are fertile ground for new 
growth, says Douglas MacFarlane, 
a chemist at Monash University 
in suburban Melbourne: vast for- 
ests of windmills and solar panels. 
More sunlight per square meter 
strikes the country than just about 
any other, and powerful winds buf- 
fet its south and west coasts. All told, Aus- 
tralia boasts a renewable energy potential 
of 25,000 gigawatts, one of the highest in 
the world and about four times the planet’s 
installed electricity production capacity. 
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Yet with a small population and few ways 
to store or export the energy, its renewable 
bounty is largely untapped. 

That’s where MacFarlane comes in. For 
the past 4 years, he has been working on 
a fuel cell that can convert renewable elec- 
tricity into a carbon-free fuel: ammonia. 
Fuel cells typically use the energy stored 
in chemical bonds to make electricity; 
MacFarlane’s operates in reverse. In his 
third-floor laboratory, he shows off one of 
the devices, about the size of a hockey puck 
and clad in stainless steel. Two plastic tubes 
on its backside feed it nitrogen gas and wa- 
ter, and a power cord supplies electricity. 
Through a third tube on its front, it silently 
exhales gaseous ammonia, all without the 
heat, pressure, and carbon emissions nor- 
mally needed to make the chemical. “This 
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is breathing nitrogen in and breathing am- 
monia out,” MacFarlane says, beaming like 
a proud father. 

Companies around the world already 
produce $60 billion worth of ammonia 
every year, primarily as fertilizer, and 
MacFarlane’s gizmo may allow them to 
make it more efficiently and cleanly. But he 
has ambitions to do much more than help 
farmers. By converting renewable electric- 
ity into an energy-rich gas that can easily 
be cooled and squeezed into a liquid fuel, 
MacFarlane’s fuel cell effectively bottles 
sunshine and wind, turning them into a 
commodity that can be shipped anywhere 
in the world and converted back into 
electricity or hydrogen gas to power fuel 
cell vehicles. The gas bubbling out of the 
fuel cell is colorless, but environmentally, 
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Acomponent in a reverse fuel cell uses 
renewable power to knit together 
water and nitrogen to make ammonia. 


MacFarlane says, ammonia is as green as 
can be. “Liquid ammonia is liquid energy,” 
he says. “It’s the sustainable technology 
we need.” 

Ammonia—one nitrogen atom bonded 
to three hydrogen atoms—may not seem 
like an ideal fuel: The chemical, used in 
household cleaners, smells foul and is toxic. 
But its energy density by volume is nearly 
double that of liquid hydrogen—its primary 
competitor as a green alternative fuel— 
and it is easier to ship and distribute. “You 
can store it, ship it, burn it, and convert it 
back into hydrogen and nitrogen,” says Tim 
Hughes, an energy storage researcher with 
manufacturing giant Siemens in Oxford, 
U.K. “In many ways, it’s ideal.” 

Researchers around the globe are chasing 
the same vision of an “ammonia economy,” 
and Australia is positioning itself to lead it. 
“It’s just beginning,” says Alan Finkel, Aus- 
tralia’s chief scientist who is based in Can- 
berra. Federal politicians have yet to offer 
any major legislation in support of renew- 
able ammonia, Finkel says, perhaps un- 
derstandable in a country long wedded to 
exporting coal and natural gas. But last year, 
the Australian Renewable Energy Agency 
declared that creating an export economy 
for renewables is one of its priorities. This 
year, the agency announced AU$20 million 
in initial funds to support renewable export 
technologies, including shipping ammonia. 

In Australia’s states, politicians see re- 
newable ammonia as a potential source 
of local jobs and tax revenues, says Brett 
Cooper, chairman of Renewable Hydrogen, 
a renewable fuels consulting firm in Syd- 
ney. In Queensland, officials are discussing 
creating an ammonia export terminal in 
the port city of Gladstone, already a hub 
for shipping liquefied natural gas to Asia. 
In February, the state of South Austra- 
lia awarded AU$12 million in grants and 
loans to a renewable ammonia project. 
And last year, an international consortium 
announced plans to build a US$10 billion 
combined wind and solar plant known 
as the Asian Renewable Energy Hub in 
Western Australia state. Although most of 
the project’s 9000 megawatts of electric- 
ity would flow through an undersea cable 
to power millions of homes in Indonesia, 
some of that power could be used to gen- 
erate ammonia for long-distance export. 
“Ammonia is the key enabler for exporting 
renewables,” says David Harris, research 
director for low-emissions technologies at 
Australia’s Commonwealth Scientific and 
Industrial Research Organisation (CSIRO) 
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Energy in Pullenvale. “It’s the bridge to a 
whole new world.” 

First, however, the evangelists for renew- 
able ammonia will have to displace one of 
the modern world’s biggest, dirtiest, and 
most time-honored industrial processes: 
something called Haber-Bosch. 


THE AMMONIA FACTORY, a metallic metrop- 
olis of pipes and tanks, sits where the red 
rocks of Western Australia’s Pilbara Desert 
meet the ocean. Owned by Yara, the world’s 
biggest producer of ammonia, and com- 
pleted in 2006, the plant is still gleaming. It 
is at the technological vanguard and is one 
of the largest ammonia plants in the world. 
Yet at its core are steel reactors that still use 
a century-old recipe for making ammonia. 
Until 1909, nitrogen-fixing bacteria made 


Most is used as fertilizer. Plants crave ni- 
trogen, used in building proteins and DNA, 
and ammonia delivers it in a biologically 
available form. Haber-Bosch reactors can 
churn out ammonia much faster than nat- 
ural processes can, and in recent decades 
the technology has enabled farmers to feed 
the world’s exploding population. It’s esti- 
mated that at least half the nitrogen in the 
human body today comes from a synthetic 
ammonia plant. 

Haber-Bosch led to the Green Revolu- 
tion, but the process is anything but green. 
It requires a source of hydrogen gas (H,), 
which is stripped away from natural gas or 
coal in a reaction using pressurized, super- 
heated steam. Carbon dioxide (CO,) is left 
behind, accounting for about half the emis- 
sions from the overall process. The second 


Australia’s windy coasts offer a bounty of energy, which it might one day export as a carbon-free fuel. 


most of the ammonia on the planet. But that 
year, German scientist Fritz Haber found 
a reaction that, with the aid of iron cata- 
lysts, could split the tough chemical bond 
that holds together molecules of nitrogen, 
N,, and combine the atoms with hydrogen 
to make ammonia. The reaction takes brute 
force—up to 250 atmospheres of pressure 
in the tall, narrow steel reactors—a process 
first industrialized by German chemist Carl 
Bosch. The process is fairly efficient; about 
60% of the energy put into the plant ends 
up being stored in the ammonia’s bonds. 
Scaled up to factories the size of Yara’s, the 
process can produce vast amounts of am- 
monia. Today, the facility makes and ships 
850,000 metric tons of ammonia per year— 
more than double the weight of the Empire 
State Building. 
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feedstock, N,, is easily separated from air, 
which is 78% nitrogen. But generating the 
pressure needed to meld hydrogen and 
nitrogen in the reactors consumes more 
fossil fuels, which means more CO,. The 
emissions add up: Ammonia production 
consumes about 2% of the world’s energy 
and generates 1% of its CO,,. 

Yara is taking a first step toward greening 
that process with a pilot plant, set to open 
in 2019, that will sit next to the existing Pil- 
bara factory. Instead of relying on natural 
gas to make H,, the new add-on will feed 
power from a 2.5-megawatt solar array into 
a bank of electrolyzers, which split water 
into H, and O,,. The facility will still rely on 
the Haber-Bosch reaction to combine the 
hydrogen with nitrogen to make ammonia. 
But the solar-powered hydrogen source 


13 JULY 2018 * VOL 361 ISSUE 6398 121 


SLO ‘vk Aine uo /610° Bewaousids'90ua!ds//:djjy Woody papeojuMOg 


PENU3 NOAFOTOBWIA TPYNNA "What's News" VK.COM/WSNWS 


NEWS | FEATURES 


A green way to make ammonia 

Reverse fuel cells can use renewable power to make ammonia from air and water, a far more 
environmentally friendly technique than the industrial Haber—Bosch process. Renewable ammonia 
could serve as fertilizer—ammonia's traditional role—or as an energy-dense fuel. 


Industrial ammonia 
Most of the world’s ammonia is synthesized using Haber—Bosch, a century-old process that 
is fast and fairly efficient. But the factories emit vast amounts of carbon dioxide (CO,). 
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Gentler reactions 

Areverse fuel cell uses renewable electricity to drive a chemical reaction that makes ammonia. Water 
reacts at the anode to make hydrogen ions (H*), which migrate to the cathode where they react with 
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cuts total CO, emissions from the process 
roughly in half. 

Other projects are following suit. The 
state of South Australia announced plans 
in February to build a AU$180 million am- 
monia plant, again relying on electrolyzers 
powered by renewable energy. Slated to 
open in 2020, the plant would be a regional 
source of fertilizer and liquid ammonia, 
which can be burned in a turbine or run 
through a fuel cell to make electricity. The 
supply of liquid energy will help stabilize 
the grid in South Australia, which suffered 
a debilitating blackout in 2016. 

Ammonia made this way should attract 
buyers in places such as the European 
Union and California, which have cre- 
ated incentives to buy greener fuels. And 
as the market grows, so will the distribu- 
tion routes for importing ammonia and the 
technologies for using it, Harris says. By 
then, fuel cells like MacFarlane’s could be 
ready to displace Haber-Bosch itself—and 
the half-green approach to ammonia pro- 
duction could become fully green. 


INSTEAD OF APPLYING fearsome heat and 
pressure, reverse fuel cells make ammonia 
by deftly wrangling ions and electrons. As 
in a battery being charged, charged ions 
flow between two electrodes supplied 
with electricity. The anode, covered with 
a catalyst, splits water molecules into O,, 
hydrogen ions, and electrons. The protons 
flow through an electrolyte and a proton- 
permeable membrane to the cathode, while 
the electrons make the journey through 
a wire. At the cathode, catalysts split N, 
molecules and prompt the hydrogen ions 
and electrons to react with nitrogen and 
make ammonia. 

At present, the yields are modest. At room 
temperature and pressure, the fuel cell reac- 
tions generally have efficiencies of between 
1% and 15%, and the throughput is a trickle. 
But MacFarlane has found a way to boost 
efficiencies by changing the electrolyte. 
In the water-based electrolyte that many 
groups use, water molecules sometimes 
react with electrons at the cathode, stealing 
electrons that would otherwise go into mak- 
ing ammonia. “We’re constantly fighting 
having the electrons going into hydrogen,” 
MacFarlane says. 

To minimize that competition, he opted 
for what’s called an ionic liquid electro- 
lyte. That approach allows more N, and 
less water to sit near the catalysts on the 
cathode, boosting the ammonia produc- 
tion. As a result, the efficiency of the fuel 
cell skyrocketed from below 15% to 60%, 
he and his colleagues reported last year in 
Energy & Environmental Science. The re- 
sult has since improved to 70%, MacFarlane 
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says—but with a tradeoff. The ionic liquid 
in his fuel cell is goopy, 10 times more 
viscous than water. Protons have to slog 
their way to the cathode, slowing the rate 
of ammonia production. “That hurts us,” 
MacFarlane says. 

To speed things up, MacFarlane and his 
colleagues are toying with their ionic liq- 
uids. In a study published in April in ACS 
Energy Letters, they report devising one 
rich in fluorine, which helps protons pass 
more easily and speeds ammonia produc- 
tion by a factor of 10. But the production 
rate still needs to rise by orders of magni- 
tude before his cells can meet targets, set 
for the field by the U.S. Department of En- 
ergy (DOE), that would begin to challenge 
Haber-Bosch. 

Next to Monash University, Sarb Gid- 
dey and his colleagues at the Clayton 
offices of CSIRO Energy are making am- 
monia with their “membrane reactor.” It 
relies on high temperatures and modest 
pressures—far less than those in a 
Haber-Bosch reactor—that, compared to 
MacFarlane’s cell, boost throughput while 
sacrificing efficiency. The reactor designs 
call for a pair of concentric long metallic 
tubes, heated to 450°C. Into the narrow gap 
between the tubes flows H,, which could be 
made by a solar- or wind-powered electro- 
lyzer. Catalysts lining the gap split the H, 
molecules into individual hydrogen atoms, 
which modest pressures then force through 
the atomic lattice of the inner tube wall 
to its hollow core, where piped-in N, mol- 
ecules await. A catalytically active metal 
such as palladium lines the inner surface, 
splitting the N, and coaxing the hydrogen 
and nitrogen to combine into ammonia— 
much faster than in MacFarlane’s cell. So far 
only a small fraction of the input H, reacts 
in any given pass—another knock to the 
reactor’s efficiency. 

Other approaches are in the works. At 
the Colorado School of Mines in Golden, 
researchers led by Ryan O’Hayre are devel- 
oping button-size reverse fuel cells. Made 
from ceramics to withstand high operat- 
ing temperatures, the cell can synthesize 
ammonia at record rates—about 500 times 
faster than MacFarlane’s fuel cell. Like Gid- 
dey’s membrane reactors, the ceramic fuel 
cells sacrifice some efficiency for output. 
Even so, O’Hayre says, they still need to im- 
prove production rates by another factor of 
70 to meet the DOE targets. “We have a lot 
of ideas,” O’Hayre says. 

Whether any of those approaches will 
wind up being both efficient and fast is still 
unknown. “The community is still trying to 
figure out what direction to go,” says Lauren 
Greenlee, a chemical engineer at the Uni- 
versity of Arkansas in Fayetteville. Grigorii 
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Soloveichik, a manager in Washington, D.C., 
for the DOE’s Advanced Research Projects 
Agency-Energy program on making renew- 
able fuels, agrees. “To make [green] ammonia 
is not hard,” he says. “Making it economically 
on a large scale is hard.” 


HOWEVER DISTANT, the prospect of Asia- 
bound tankers, full of green Australian am- 
monia, raises the next question. “Once you 
get ammonia to market, how do you get 
the energy out of it?” asks Michael Dolan, 
a chemist at CSIRO Energy in Brisbane. 

The simplest option, Dolan says, is to 
use the green ammonia as fertilizer, like 
today’s ammonia but without the carbon 
penalty. Beyond that, ammonia could be 
converted into electricity in a power plant 
customized to burn ammonia, or in a tradi- 
tional fuel cell, as the South Australia plant 
plans to do. But currently, ammonia’s high- 
est value is as a rich source of hydrogen, 
used to power fuel cell vehicles. Whereas 
ammonia fertilizer sells for about $750 a 
ton, hydrogen for fuel cell vehicles can go 
for more than 10 times that amount. 

In the United States, fuel cell cars seem 
all but dead, vanquished 
by battery-powered ve- 
hicles. But Japan is still 
backing fuel cells heav- 
ily. The country has spent 
more than US$12 billion 
on hydrogen technology 
as part of its strategy 
to reduce fossil fuel im- 
ports and meet its commitment to reduce 
CO, emissions under the Paris climate ac- 
cord. Today the country has only about 
2500 fuel cell vehicles on the road. But by 
2030 Japanese officials expect 800,000. 
And the nation is eyeing ammonia as a way 
to fuel them. 

Converting hydrogen into ammonia 
only to convert it back again might seem 
strange. But hydrogen is hard to ship: It 
has to be liquefied by chilling it to temper- 
atures below -253°C, using up a third of its 
energy content. Ammonia, by contrast, liq- 
uefies at -10°C under a bit of pressure. The 
energy penalty of converting the hydrogen 
to ammonia and back is roughly the same 
as chilling hydrogen, Dolan says—and 
because far more infrastructure already 
exists for handling and transporting am- 
monia, he says, ammonia is the safer bet. 

That last step—stripping hydrogen off 
ammonia molecules—is what Dolan and 
his colleagues are working on. In a cavern- 
ous metal warehouse on the CSIRO cam- 
pus that has long been used to study coal 
combustion, two of Dolan’s colleagues are 
assembling a 2-meter-tall reactor that is 
dwarfed by a nearby coal reactor. When 
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“It looks like there’s 
enough interest to get 
this industry started.” 


David Harris, CSIRO Energy 


switched on, the reactor will “crack” am- 
monia into its two constituents: H,, to be 
gathered up for sale, and N,, to waft back 
into the air. 

That reactor is basically a larger version 
of Giddey’s membrane reactor, operating 
in reverse. Only here, gaseous ammonia is 
piped into the space between two concen- 
tric metal tubes. Heat, pressure, and metal 
catalysts break apart ammonia molecules 
and push hydrogen atoms toward the 
tube’s hollow core, where they combine to 
make H, that’s sucked out and stored. 

Ultimately, Dolan says, the reactor will 
produce 15 kilograms per day of 99.9999% 
pure hydrogen, enough to power a few fuel 
cell cars. Next month, he plans to demon- 
strate the reactor to automakers, using it 
to fill tanks in a Toyota Mirai and Hyundai 
Nexo, two fuel cell cars. He says his team is 
in late-stage discussions with a company to 
build a commercial pilot plant around the 
technology. “This is a very important piece 
of the jigsaw puzzle,” Cooper says. 

Beyond 2030, Japan will likely import 
between $10 billion and $20 billion of 
hydrogen each year, according to a re- 
newable energy road- 
map recently published 
by Japan’s Ministry of 
Economy, Trade and 
Industry. Japan, Singa- 
pore, and South Korea 
have all begun discus- 
sions with Australian of- 
ficials about setting up 
ports for importing renewably produced 
hydrogen or ammonia. “How it all comes 
together economically, I don’t know,” Har- 
ris says. “But it looks like there’s enough 
interest to get this industry started.” 

Cooper knows how he wants it to end. 
Over coffee on a rainy morning in Sydney, 
he describes his futuristic vision for renew- 
able ammonia. When he squints, he can see, 
maybe 30 years down the road, Australia’s 
coast dotted with supertankers, docked 
at offshore rigs. But they wouldn’t be fill- 
ing up with oil. Seafloor powerlines would 
carry renewable electricity to the rigs from 
wind and solar farms on shore. On board, 
one device would use the electricity to de- 
salinate seawater and pass the fresh water 
to electrolyzers to produce hydrogen. An- 
other device would filter nitrogen from the 
sky. Reverse fuel cells would knit the two 
together into ammonia for loading on the 
tankers—a bounty of energy from the sun, 
air, and sea. 

It’s the dream that nuclear fusion never 
reached, he says: inexhaustible carbon-free 
power, only this time from ammonia. “It 
can never run out, and there is no carbon 
in the system.” 
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When persistence doesn’t pay 


Rats, mice, and humans all invest more time in a foraging 
task than is in their interest 


By Sarah F. Brosnan’? 


eople routinely make bad decisions. 
Far from being random, however, 
these bad decisions are often predict- 
able, occurring reliably in specific con- 
texts (/-3). One set of such suboptimal 
decisions are cognitive biases, wherein 
individuals make decisions that predictably 
violate rationality or their own best interests 
without a logical reason for doing so (4). One 
such bias is the sunk cost bias, in which indi- 


124 = 13 JULY 2018 + VOL 361 ISSUE 6398 


viduals invest more time or resources in an 
outcome than it is worth given the potential 
gain, presumably because they focus on hind- 
sight evaluation of irrecoverable costs rather 
than prospective gains (5). On page 178 of 
this issue, Sweis et al. (6) report a clever ex- 
periment demonstrating the sunk cost bias 
in rats, mice, and humans. The findings also 
provide insight into why results from previ- 
ous studies have been so variable. 

Cognitive biases can be substantial imped- 
iments to optimal decision-making and are 


Published by AAAS 


costly to individuals and society. Comparing 
responses across different species is key to 
understanding how these biases evolved and 
will help to both predict when they will man- 
ifest and offer suggestions as to how to offset 
them. However, determining how the sunk 
cost effect evolved has not been easy, be- 
cause the bias is not consistently expressed 
in other species (7-9). 

Studying the sunk cost bias is challenging 
because it is difficult to separate investment 
from future expectations, especially in field 
studies. Lab experiments can more easily dis- 
tinguish between these factors, but lab exper- 
iment setups can differ substantially across 
species, particularly between humans and 
other species, making comparisons difficult. 
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When foraging, animals must decide whether to 
continue in the current patch or move on. Staying too 
long is an example of the sunk cost bias, which 

Sweis et al. demonstrate in rats, mice, and humans. 


Sweis et al. overcome the latter problem by 
using the same basic procedure, based on a 
foraging task, across all three species. 

In their experiment, subjects had a lim- 
ited time budget during which they rotated 
through four reward options that differed in 
how well the subjects liked them (subjects 
showed consistent preferences across the 
study). At each transition, subjects first had 
to decide whether to commit to the available 
option or move to the next option. If they 
committed, they could not get the reward 
until they had waited for a predetermined 
period of time; the time remaining was indi- 
cated to the subject throughout the waiting 
period. Subjects could leave at any point dur- 
ing the waiting period, without penalty, and 
move to the next option in the rotation. 

A subject not showing the sunk cost bias 
should leave quickly if a less preferred re- 
ward is paired with a long wait time. None- 
theless, subjects of all species were more 
committed to sticking with their choices 
than their preferences warranted, thus dem- 
onstrating a sunk cost bias. Indeed, this bias 
grew stronger the longer they waited; that 
is, it increased with increasing investment. 
However, it was unaffected by the time spent 
making the choice initially. Supplemental 
work indicates that there is a distinct deci- 
sion phase during which sunk costs do not 
accumulate, even when this phase is not ex- 
plicitly included as a separate period in the 
experimental design. The authors argue that 
this is evidence for two independent valua- 
tion processes, one assessing whether to ac- 
cept an offer and the other—the one that is 
susceptible to sunk costs—assessing whether 
to continue investing in the choice. 

The evidence for two different valuation 
systems working together offers the possibil- 
ity of a comprehensive explanation for the 
sunk cost effect that integrates cognition, be- 
havior, and neural architecture across species 
and contexts. Moreover, these results hint at 
some of the reasons for the previous dispa- 
rate results. Perhaps wait times were not long 
enough to get past the assessment period, or, 
more speculatively, decisions may change if 
subjects do not know how long they must 
wait or if they are otherwise engaged in an 
active task (such as lever pressing) during the 
waiting period. 

A key reason for Sweis et al’s success is the 
degree to which they worked to provide a rel- 
evant, comparable methodology. They chose 
a context, foraging, that had ecological rel- 
evance to these species in the context of sunk 
costs and came up with a clever way of pro- 
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viding information about how much future 
investment remained to minimize informa- 
tion uncertainty. Importantly, they provided 
the humans and the rodents with the same 
procedures and information. 

Of course, there were differences among 
species. Differences in training between the 
rodent labs required control tests to verify 
that behavior was truly the same. Humans, 
unlike rodents, are not consistently food 
motivated, and so they “foraged” for enter- 
tainment, in the form of movie clips, instead. 
Such adaptations to account for species-typ- 
ical behavior and differing environments are 
nearly unavoidable in comparative work, but 
it is critical that the fundamental procedure 
is the same and that the limitations of one’s 
study are acknowledged (10), as in Sweis et al. 
For instance, the authors chose film clips be- 
cause, like the differently flavored food, they 
differ in value but not handling time, which 
allows for a direct comparison across the four 
choice options. 

Why are we susceptible to the sunk cost 
bias? In some situations, it may be difficult 
to evaluate future costs; in these cases, past 
efforts can be a reasonable proxy. If this heu- 
ristic worked well enough most of the time, 
there would not have been strong selection 
pressure to evolve a more sophisticated strat- 
egy. Indeed, many human decision-making 
biases likely result from a trade-off between 
the costs of acquiring and processing infor- 
mation and the costs of getting it wrong (JJ). 
Understanding this evolutionary trade-off 
can help us to better understand why hu- 
mans make the decisions that we do and, ul- 
timately, suggest ways of improving human 
decision-making (12). Sweis et al.’s adaptable 
design is likely to prove helpful in future 
experiments investigating sunk cost biases 
across a variety of species and contexts. 
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EVOLUTION 


Many roads 


to convergence 


Plant genomes highlight 
complex mechanisms behind 
evolutionary convergence 


By Laszl6 G. Nagy 


any plants form specialized symbi- 
otic root structures, called nodules, 
that harbor beneficial associations 
with nitrogen-fixing bacteria in 
the genera Rhizobium or Frankia 
(see the photo). How this nitrogen- 
fixing root nodule (NFN) symbiosis arose 
repeatedly during plant evolution is an age- 
old mystery: It shows signatures of conver- 
gence (the repeated emergence of similarity 
during evolution) yet builds on similar gene 
sets in phylogenetically distant plants. On 
page 144 of this issue, Griesmann et al. (1) 
sequenced the genomes of 10 plant species 
to reveal the genetic correlates of the ori- 
gin and loss of NFN symbiosis. Their work 
reveals intricate gain and loss patterns of 
symbiosis-associated genes, calling for new 
models to explain convergent evolution. 
NFN symbiosis has an immense impact 
on plant growth and global nitrogen cy- 
cling but occurs only in four related plant 
orders, the Fabales, Fagales, Cucurbitales, 
and Rosales, collectively known as the NFN 
clade. Only a few phylogenetically diverse 
members of the NFN clade form symbio- 
sis, making NFN symbiosis a phylogeneti- 
cally patchy character (2, 3). Despite its 
phylogenetic patchiness, nodule formation 
builds on surprisingly similar gene sets (4), 
with some key symbiosis-associated genes, 
such as NODULE INCEPTION (NIN) and 
RHIZOBIUM-DIRECTED POLAR GROWTH 
(RPG), being widely conserved across 
plants. Besides NIN and RPG, another 
290 genes were recently found to be up- 
regulated in nodules of Medicago truncatula 
and Parasponia andersonii, two plant spe- 
cies that independently evolved the ability 
to form symbiosis. This is surprising, given 
that these species diverged ~100 million 
years ago and belong to two separate or- 
ders, the Fabales and Rosales, respectively 
(4). How can this level of genetic similarity 
be reconciled with the phylogenetic patchi- 
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ness of nodulation? Griesmann e¢ al. out- 
lined three possible evolutionary scenarios 
to describe the extant distribution of NFN 
symbiosis across the plant tree of life: It 
could be described by up to 16 independent 
evolutionary origins, a single origin with 
multiple losses, or an intermediate scenario 
combining homologous and convergently 
evolved components of the underlying ge- 
netics. The intermediate scenario invokes a 
predisposition event, the gain of a genetic 
element (gene, regulatory interaction, or 
other), that made descendant lineages more 
likely to evolve NFN symbiosis (5). 

Griesmann et al. sequenced the genomes 
of 10 carefully selected plant spe- 
cies to cover several clades that 
independently gained the ability to 
form NFN symbiosis. The authors 
embarked on a sizable challenge 
to identify genomic evidence for 
a postulated predisposition event, 
either for independent gains or for 
independent losses. Searching mul- 
tiple genomes for genetic changes 
underlying the evolution of a trait is 
like finding a needle in a haystack, 
although incorporating the gain 
and loss patterns of the trait across 
the phylogeny can help narrow the 
search space down (6). Indeed, the 
authors found several genes, such 
as NIN and RPG, that were lost in 
clades that lost the ability to form 
NEN symbiosis. Such losses have im- 
portant implications for genetic en- 
gineering of NFN symbiotic plants. 
In nature, the symbiosis can be lost 
in nitrogen-rich habitats, where the 
cost of forming symbiosis outweighs 
its benefits, making symbiosis-asso- 
ciated genes dispensable. In agricul- 
tural settings, however, long-term 
fertilizer application might lead to 
similar losses of the ability to nodu- 
late, which could be exploited with 
the development of engineered NFN 
symbiotic crop plants in regions where fer- 
tilizer availability is limited. 

Another 52 gene families showed parallel 
expansions in multiple symbiotic subclades 
of the NFN clade and contained known 
nodulation-associated genes of Medicago, 
although most gene family expansions were 
specific to individual subclades. These ob- 
servations would make NFN symbiosis a 
classic example of convergent evolution: 
independent acquisitions of similar func- 
tions via different evolutionary routes, with 
dissimilar genetic backgrounds. However, 
many symbiosis-associated genes predate 
the origin of the NFN clade and are uni- 
versally up-regulated during nodule forma- 
tion in independent clades (4), suggesting 
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a more complex scenario involving homol- 
ogy that dates back to early, nonnodulating 
precursors. 

Convergently evolved traits involving a 
similar genetic background are hard to ex- 
plain. Although the multiple-origins model 
is often the phylogenetically most parsi- 
monious for such traits, the likelihood of 
parallel changes affecting tens to hundreds 
of the same genes is so low that multiple in- 
dependent origins seem implausible. More 
complex models, involving deep homolo- 
gies (7), latent homologies (8), predisposi- 
tion events, or parallel co-option of genes 
have been proposed to explain the evolu- 
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Acolored scanning electron micrograph of a bean plant root nodule 
shows nitrogen-fixing Rhizobium bacteria (green) embedded in plant 
tissue, forming a symbiotic relationship (magnification x50). 


tion of phylogenetically patchy traits, like 
NFN symbiosis. Although these have been 
used to denote various, sometimes overlap- 
ping processes, they refer to at least two 
distinct evolutionary mechanisms. Traits 
are deeply homologous if homology is not 
evident at the level of the entire trait but 
historical continuity of shared function ex- 
ists for at least some of the genes involved 
in its development. Arthropod and verte- 
brate eyes are textbook examples of deep 
homologies: They develop by largely nonho- 
mologous mechanisms but build on homol- 
ogous genes, such as opsins, developmental 
transcription factors, and photoreceptor 
cells that can be traced back to primor- 
dial photosensory organs of a cnidarian or 
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bilaterian ancestor (9). In this case, both 
function and genetics are homologous, but 
evidence for this is blurred by divergence 
and/or lineage-specific refinements, overall 
leading to apparent convergence. 

Conversely, latent homologies are de- 
velopmental potentials (for example, gene 
regulatory circuits) that can be deployed 
(through co-option or exaptation) for new 
functions by simple genetic changes. Be- 
cause a few mutations (for example, in 
regulatory sequences) are sufficient for 
their expression in a new context, latent 
homologies reduce the mutational target 
size for evolution and can thus promote 
convergence. Mechanistically, pre- 
disposition events can comprise the 
evolution of latent homologies that, 
as hypothesized for the NFN clade 
(5), lead to a higher likelihood for 
convergence to occur. 

Teasing apart the possible mecha- 
nisms behind convergently evolved 
traits remains a substantial chal- 
lenge even in the era of genomics. 
It nevertheless appears that case 
studies and models are emerging 
to explain the pervasive occur- 
rence of convergence across the 
tree of life. NFN symbiosis rep- 
resents one particularly intricate 
case of phylogenetically patchy 
traits. The phylogenomic compari- 
sons by Griesmann et al. provide 
important clues about the genetics 
of the repeated emergence of NFN 
symbiosis and the genes that are 
repeatedly lost in secondarily non- 
symbiotic clades, although some 
ambiguity remains as to the con- 
vergent origins. With the increas- 
ing accessibility of whole-genome 
sequences, such comparisons will 
likely play an increasing role in 
dissecting the genetic background 
of complex multigenic traits, such 
as nodulation, and decoding long- 
standing evolutionary questions, including 
the many roads that can lead to evolved 
similarity in disparate branches of the tree 
of life. i 
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CHEMISTRY 


Femtosecond structural photobiology 


Time-resolved crystallography reveals how bacteriorhodopsin uses light to drive chemistry 


By Keith Moffat 


bsorption of a photon in the visible 
region of the solar spectrum is the 
fundamental step in the photochem- 
ical reactions that drive biological 
processes such as imaging, photo- 
synthesis, and sensory perception. 
All known photoreceptors are proteins that 
contain a chromophore: a small organic 
molecule that absorbs in the visible region 
of the spectrum (J). The bacterial integral 
membrane protein bacteriorhodopsin (bR) 
uses retinal as its chromophore (see the 
figure), capturing the energy in a photon 
to drive retinal isomerization 
and pump protons across the 
membrane. bR is a readily ac- 


rhodopsins 


molecular dynamics calculations; and 
structural probes with low spatial and 
temporal resolution. Spectroscopic data 
provide a simple linear model for the reac- 
tion path. Explicit structural information 
on the femtosecond time scale is needed to 
develop a more detailed picture. 

Nogly et al. now provide these data in 
the face of major experimental and ana- 
lytical difficulties. The structural changes 
that occur during isomerization are small; 
few atoms move, and those that do move 
do not move very far. The time-dependent 
changes in x-ray scattering that constitute 
the raw data are therefore also small, yet 


Tracking femtosecond retinal changes 
cessible analog to the visual Difference electron density maps after the pump laser pulse show the retinal 
in, for example, structure in bacteriorhodospin (bR) before to isomerization (top) and during 


the human eye. On page 145 __ isomerization (bottom). 


of this issue, Nogly et al. (2) 
used time-resolved x-ray crys- 


at near-atomic resolution on 
the femtosecond to picosecond 
time scales at which isomeriza- 
tion occurs. 

The photochemistry of reti- 
nal in bR differs substantially 
from that of free retinal. In- 
teractions between the retinal 
and its protein surroundings 
tightly constrain the chromo- 
phore and confer specificity 
in trans-to-cis isomerization 
about the C13=C14 bond, high 
quantum yield, and efficient 
proton pumping. These constraints must 
be very selective. Isomerization requires 
carefully choreographed, ultrafast atomic 
motions that have evolved to direct the re- 
action along the desired path and inhibit 
all undesired reactions, such as isomeriza- 
tion about a different bond or rapid rever- 
sion to the ground state. 

How is this molecular choreography 
achieved? Extensive earlier studies (5) 
have explored this question by using fem- 
tosecond optical spectroscopies; quantum 
mechanical, molecular mechanics, and 


Department of Biochemistry and Molecular Biology and 
Institute for Biophysical Dynamics, University of Chicago, 
Chicago, IL 60615, USA. Email: moffat@cars.uchicago.edu 


SCIENCE sciencemag.org 


Time bin one (49 to 40 6fs) 
tallography (3, 4) to determine Small structural changes in the 
intermediate structures of bR _ Mitialelectronic excited _ 
state occur after a photon is 
abosorbed but before 
isomerization has occurred. 


Time bin two (457 to 646 fs) 
In the J state, isomerization 

is half complete, with 

a torsion angle of 90° about 

the C13=C14 bond. 


Negative 
difference 
density 
cy 


their accurate measurement is essential. 
The authors’ visible laser pump-hard x-ray 
probe experiments are analogous to mak- 
ing a movie one frame at a time. A light 
pulse from a visible laser—the pump—ini- 
tiates the reaction in the bR molecules in a 
tiny crystal. After a controlled time delay, 
the crystal is illuminated by an extremely 
intense pulse from an x-ray laser—the 
probe. By obtaining probe data over all 
crystal orientations, the authors determine 
the structure of the bR molecules at that 
time delay. By varying the time delay, the 
“light” structures are determined frame 
by frame—the reaction path. If the visible 
laser pulse is omitted, no reaction occurs, 
and the “dark” structure is obtained. 
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Synchrotron x-ray sources at which such 
experiments were pioneered are restricted 
to a time resolution of ~100 ps (6). Nogly 
et al. now access isomerization on the fem- 
tosecond to picosecond time scale by us- 
ing the extremely intense, femtosecond, 
hard x-ray pulses from the Linac Coherent 
Light Source at Stanford, the world’s first 
hard x-ray free-electron laser (XFEL) (7, 8). 
However, XFELs are based on noise ampli- 
fication and generate x-ray scattering data 
that are prone to systematic errors. Each 
tiny bR crystal is destroyed by the extremely 
intense XFEL pulse immediately after gen- 
erating its diffraction pattern. 

Nogly et al. present differ- 
ence electron density maps 
that minimize systematic error 
by directly comparing the x-ray 
scattering from the light struc- 
ture (at a known, variable time 
delay after the pump pulse) 
and dark structure (before the 
pump pulse). Such difference 
maps are highly sensitive to 
small structural differences 
(9). To obtain atomic struc- 
tures, the authors converted 
the difference maps into time- 
dependent, extrapolated den- 
sity maps and further averaged 
these maps into time bins cen- 
tered on the anticipated time 
of maximum concentration of 
the I, J, and K spectroscopic 
states. They then refined a 
single atomic structure in each 
extrapolated density map. 

Two effects may complicate 
the accuracy of refinement. First, in any 
dynamic experiment, an individual time 
delay—here, a time bin—typically contains 
a mixture of structures (3, 4). Second, ab- 
sorption of a photon by retinal generates 
a vibrationally excited molecule far from 
equilibrium, with unusual bond lengths 
and angles. Reassuringly, the single refined 
structures are chemically plausible and 
consistent with both calculations (2) and 
the previously proposed aborted bicycle 
pedal model for isomerization (JO, 11). 

The difference maps are of interest both 
for the atomic motions they reveal and for 
those they do not reveal. Certain protein 
atoms comprising the retinal pocket in the 
dark do not move; they impose constraints 


bR refined 
intermediates 
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on retinal motion throughout isomerization 
(see the figure). Other atoms and groups 
move in concert, restricting isomerization 
to the C13=C14 bond. A counter-ion cluster 
consisting of Asp*, Asp?”, and a key buried 
water molecule (Wat*) stabilizes the posi- 
tive charge on the Schiff base in the dark 
state. Even before isomerization in the 
electronically excited state, this cluster is 
disrupted, and specific hydrogen bonds are 
weakened. The positive charge on the Schiff 
base is redistributed along the polyene 
chain. The alternation of single and double 
bonds along the polyene chain is perturbed 
and at least partial single-bond character 
conferred on the C13 to C14 bond (a double 
bond in the dark state) to facilitate its twist- 
ing as isomerization begins. 

In bR, isomerization appears to be half 
complete (with a torsion angle of 90° 
about the C13=C14 bond) at ~500 fs. Isom- 
erization has been studied with femtosec- 
ond time-resolved crystallography in two 
other proteins: photoactive yellow protein, 
whose small chromophore contains only a 
single double bond about which isomeriza- 
tion is also half complete at ~500 fs (12), 
and a photoswitchable fluorescent protein, 
in which a larger chromophore must ex- 
ecute molecular contortions to accomplish 
isomerization in ~1 ps (73). It is likely that 
electronic and structural similarities ex- 
ist in the earliest steps of the elementary 
chemical process of isomerization in differ- 
ent chromophores. 

Challenging questions remain. Is the na- 
ture of the cis-to-trans isomerization path- 
way in the physiologically important visual 
rhodopsins simply the reverse of the trans- 
to-cis pathway seen here in bR? Are the 
structures and charge distributions of the 
retinal pockets in the trans and cis species 
in bR and visual rhodopsins closely similar? 
Nogly et al. show that the techniques are 
firmly in place to address these questions. 
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Double trouble at the 
beginning of life 


Dual-spindle assembly in early embryos can 
compromise mammalian development 


By Agata P. Zielinska and Melina Schuh 


very human life begins with the fer- 

tilization of an egg (J). Once the egg 

and the sperm have fused, the paren- 

tal chromosomes need to be united. 

To this end, the egg and sperm chro- 

mosomes are first packaged into two 
separate membrane-enclosed nuclei. These 
nuclei then slowly move toward each other 
and break down in the center of the fertil- 
ized egg, called the zygote. Only then are 
the maternal and paternal chromosomes 
united—but not quite. Surprisingly, the pa- 
rental chromosomes do not mix immediately 
but instead occupy distinct territories in 
the zygote throughout the first cellular divi- 
sion (2, 3). How the autonomy of parental 
genomes is retained after fertilization has 
remained unclear. On page 189 of this issue, 
Reichmann et al. (4) used elegant microscopy 
methods to illuminate this special moment, 
when the parental chromosomes first meet in 
live mouse zygotes, and follow how the chro- 
mosomes become distributed as the zygote 
divides. Their findings reveal an unexpected 
mechanism that keeps the parental genomes 
apart during the first division of the embryo: 
The male and female chromosomes each as- 
semble their own chromosome separation 
machineries. This increases the probability 
that chromosomes are separated into multi- 
ple, unequal groups, which may compromise 
embryo development and give rise to sponta- 
neous miscarriage. 
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Reichmann et al. labeled the maternal 
and paternal chromosomes in different 
colors by taking advantage of distinct DNA 
sequences (5) in the parental chromosomes, 
which came from different mouse strains 
(6). To follow in detail how the chromo- 
somes are united, zygotes have to be imaged 
at very high spatial and temporal resolu- 
tion. However, embryos are light sensitive, 
which has hindered a detailed analysis in 
the past. The authors overcame this limi- 
tation by using an innovative light-sheet 
microscope, which illuminates the embryo 
selectively in the region of interest but not 
in adjacent regions, as is the case with stan- 
dard microscopy approaches (7). This re- 
duces the amount of light that the embryo 
is exposed to. Moreover, this method is fast 
and allowed the authors to reconstruct the 
entire volume that is occupied by the chro- 
mosomes with unprecedented spatial and 
temporal resolution. 

The authors tracked the maternal and 
paternal chromosomes as they first met. In 
addition, they imaged microtubules, the pro- 
teinaceous fibers that form the spindle ap- 
paratus that captures, aligns, and distributes 
the chromosomes equally between the two 
daughter cells of the dividing zygote. Surpris- 
ingly, Reichmann et al. found that the mater- 
nal and paternal chromosome masses each 
assemble a separate spindle structure that 
autonomously initiates chromosome align- 
ment (see the images). Later, the two spindles 
merge into a single spindle. However, the ma- 
ternal and paternal chromosomes remain in 
separate regions of the merged spindle and 
do not mix. 
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Immunofluorescence staining of a mouse zygote 
(left image) shows parallel alignment of two mitotic 
spindles (green). This is overlaid (right image) 
with microtubule organizing centers (magenta), sites 
of microtubule binding to chromosomes (gray), 

and chromosomes (blue). 


Whether the spatial separation of paren- 
tal chromosomes has any advantages for the 
developing mammalian embryo is unclear. 
However, the fact that zygotes have a dual 
spindle creates a previously unforeseen 
source of potential error. The final task the 
zygote has to accomplish before it divides 
is to align the two spindle axes in parallel 
to each other, so that the two spindles can 
merge into a compact dual structure. If the 
poles of the spindles fail to align and merge, 
the genetic material of the zygote could be 
pulled into three or four direc- 
tions instead of two (see the 
figure). Reichmann et al. dem- 


permissive environment for the formation of 
two independent spindle structures. 

The presence of centrosomes in human zy- 
gotes could facilitate the assembly of a single 
bipolar spindle. However, it is still unclear 
whether spindle assembly in human zygotes 
is primarily centrosome driven, or whether 
they also assemble two distinct spindles from 
chromosome surfaces. Interestingly, spin- 
dle assembly in unfertilized human eggs is 
largely chromosome driven (73), which raises 
the possibility that also in human zygotes, 
even in the presence of centrosomes, chro- 
mosome-driven spindle assembly may occur. 

In many countries, the law considers that 
new human life begins when parental chro- 
mosomes are united upon fertilization (14). 
This work is a reminder that the definition 
of the unification of parental genomes as the 


beginning of life is not as clear cut as was pre- 
viously assumed. How we define the “begin- 
ning of life” not only has ethical implications 
but also practical repercussions for fertility 
treatments. This is because in some countries, 
such as Germany (J4), parental nuclei are al- 
lowed to merge and hence life is allowed to 
“begin” in IVF clinics only in a very limited 
number of in vitro-cultured zygotes, all of 
which have to be transferred to the mother. 
This policy increases the rate of multiparous 
pregnancies, which are associated with se- 
vere risks to the health of the mother and her 
children. Additionally, because identification 
of the most promising fertilized eggs has to 
legally occur before the parental genomes 
have merged (so that embryos with merged 
genomes, defined as being human life, are 
not discarded), embryologists have to select 
embryos for implantation at the 
zygotic stage. Because our under- 
standing of early human embryo- 


onstrate that spindle misalign- 
ment leads to the formation of 
multinucleated embryos that 
have more than one nucleus per 
cell. It is easy to envision that 
such an undesired partition- 
ing of the DNA would have a 
negative impact on the fidel- 
ity of subsequent cell divisions 
and hence might compromise 
embryo development. Indeed, 
experience from in vitro fertil- 
ization (IVF) clinics shows that 
early human embryos cultured 
in vitro before implantation fre- 
quently have multiple nuclei in 
their cells (8) and then fail to 
develop further (9). 

To evaluate the implications of 
these findings for human infertil- 
ity, it is important to investigate 
whether cells with multiple nu- 
clei in human embryos are also 
caused by defects during dual- 
spindle merging, as reported for 
mouse embryos in this study. In 
humans, the sperm delivers a 
centrosome into the zygote (JO, 
11). Centrosomes are microtu- 
bule-organizing centers that help 
to generate a bipolar spindle in 
somatic cells (12). However, in 
mice, the sperm does not carry 
a centrosome, and the zygotic 
spindle needs to assemble with- 
out the help of centrosomes (1). 
Reichmann et al. demonstrate 
that chromosomes play a key 
role in driving spindle assembly 
in the mouse zygote. The chro- 
mosome-based spindle assembly 


Together, but still apart 


Upon fertilization, parental chromosomes need to be united. Unexpectedly, in 


mice, male and female chromosomes are kept apart on separate mitotic spindles 


during the first division of the zygote. This increases the probability of forming 
multinucleated cells. 
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genesis is still poor, it is difficult 
to select at this early stage the 
embryos that have the high- 
est chance of dividing correctly, 
implanting, and giving rise to a 
healthy pregnancy. Considering 
that there is now comprehensive 
evidence that the parental DNA 
does not mix in live zygotes, this 
study highlights that a dialogue 
is needed to reshape the legal 
definition of when life begins, so 
that it is supported by up-to-date 
scientific knowledge. By revisit- 
ing this definition, we could in- 
crease the chances of a healthy 
pregnancy among couples who 
otherwise struggle to conceive. 
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INFECTIOUS DISEASES 


Eradication genomics— 
lessons for parasite control 


Genomic surveillance could help achieve targets 
for the elimination of tropical diseases 


By James A. Cotton, Matthew Berriman, 
Love Dalén, Ian Barnes 


arge-scale programs are seeking to con- 

trol or eliminate infectious diseases 

with the greatest impact on global 

health. Many of these efforts target 

the neglected tropical diseases (NTDs) 

that disproportionately affect the lives 
of the poor. Often the aim is to eradicate the 
causative pathogens. The idea—inspired by 
the success of smallpox eradication in the 
1960s and 1970s—is that a large-scale, but 
time-limited, effort could eliminate a disease 
for all successive generations, resulting in 
an enormous payoff both financially and in 
improved health. Here, we discuss the value 
of genomic approaches to support disease 
eradication efforts, particularly by analogy 
with how conservation genomics is support- 
ing efforts to prevent extinctions. 

Although there are fungal, viral, and bac- 
terial NTDs, most are caused by infections 
with parasites (particularly protozoa and 
helminths), including the protozoa _ that 
cause sleeping sickness, leishmaniasis, and 
Chagas disease, or worms that cause diseases 
such as schistosomiasis, hookworm disease, 
and river blindness. Only two diseases—hu- 
man smallpox and the cattle disease rin- 
derpest—have been successfully eradicated; 
both of these campaigns relied on the avail- 
ability of highly effective and long-lasting 
vaccines that are not available for any of the 
parasitic NTDs. Control methods vary be- 
tween the different diseases and can include 
controlling parasite carriers (vectors), better 
sanitation, and improvements to health sys- 
tems to allow faster detection and treatment. 
However, the main approach has been large- 
scale treatment of people at risk of infection, 
whether they actually have the disease or not. 
The size of these interventions is astounding: 
More than 1 billion people were treated for 
NTDs in 2016, with drugs donated by manu- 
facturers. Nonetheless, this represents only 
63% of the treatment needed to fully cover 
the populations at highest risk (7). 

Key to the success of NTD control is mea- 
suring changes in pathogen populations, to 
ensure that interventions are on course to 
achieve elimination goals and to direct addi- 
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tional resources to areas of high or stubborn 
prevalence. However, monitoring pathogen 
populations by conventional techniques is 
challenging in many of the environments 
where these diseases flourish, particularly 
as only limited diagnostics are available 
(2). Monitoring will likely become particu- 
larly difficult as prevalence is reduced (3). 
The power of molecular data to understand 
pathogen populations is already widely ap- 
preciated: Genetics can help confirm sus- 
pected cases, discriminate between strains, 
identify new infectious agents, understand 
the evolutionary history of populations, and 
shed light on transmission. For example, in 
the polio eradication campaign, a global net- 
work sequenced part of every virus isolated 
from around 200,000 stool specimens per 
year to identify transmission links and track 
the disappearance of particular genotypes 
(4). Genetic data identified polio outbreaks 
caused by oral vaccine-derived revertant 


“Genomic surveillance 
could thus be a powerful 
tool for tracking... 
parasite populations.” 


strains, highlighting a need to gradually 
withdraw serotypes from the vaccine (5). 

Whereas the most comprehensive genetic 
data are for prokaryotes and viruses, most 
NTDs are caused by eukaryotic parasites that 
at least occasionally reproduce sexually. Most 
viruses and bacteria reproduce clonally, and 
exchange genetic material unpredictably, so 
molecular epidemiology can be restricted to 
tracking the abundance of different lineages 
through space and time. Recombination in 
sexual eukaryotes makes each region of the 
genome an independent genetic marker, so 
genomic data for a small number of individ- 
uals can reveal a great deal about the popula- 
tion those individuals come from. Genomic 
surveillance could thus be a powerful tool 
for tracking changes in the demography or 
abundance of parasite populations. 

In many ways, the study of eradication 
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genomics is a mirror image of conservation 
genomics. Although the latter aims to under- 
stand populations to preserve biodiversity 
and ultimately reverse species declines, eradi- 
cation genomics studies the genetics of popu- 
lations that are deliberately being reduced by 
human activity, to enhance our ability to de- 
press the populations further and ultimately 
remove them completely. For some time, 
population genetic data have been used in 
conservation biology to understand changes 
in endangered populations (6), but we expect 
that analysis of eukaryotic pathogen popula- 
tions could result in some important, more 
subtle lessons from conservation genomics. 
For example, that isolated subpopulations are 
harder to conserve implies that eradication 
will be easier if pathogen populations can be 
isolated from one another. In small popula- 
tions, stochastic changes in allele frequencies 
due to random death or survival of individu- 
als can swamp the natural selection of benefi- 
cial genetic variation; the increased impact of 
this genetic drift can make it much harder for 
small populations to adapt to changing en- 
vironments. Furthermore, inbreeding within 
small isolated populations is expected to be 
deleterious (7). Looking for genomic signa- 
tures such as reduced genetic diversity and 
longer sequences of homozygosity due to in- 
breeding may therefore identify populations 
close to elimination (8) (see the figure). 

One difficulty is that levels of genome-wide 
diversity in species can be strongly affected by 
life history and ancient demographic events, 
as well as recent changes in population size. 
We may thus need a good picture of pathogen 
populations before control measures have an 
impact. Therefore, collecting and conserving 
samples for genetic analysis should be an 
urgent consideration for control programs. 
Guinea worm is the parasite closest to eradi- 
cation; cases have declined from several mil- 
lion per year in the 1980s to 30 in 2017 (9). 
However, Guinea worm infections were re- 
ported in Chad in 2011 after a decade with 
no reported cases. Coincident with this re- 
emergence were infections in dogs, although 
Guinea worm was not previously considered 
zoonotic (10). We assume that Guinea worm 
evolved during the 10-year gap, but the lack 
of suitable samples predating 2011 makes 
it impossible to exclude the possibility that 
worms migrated from elsewhere during this 
time. Understanding the dog infections is key 
to the eradication program: Although there 
were only 25 human cases reported world- 
wide in 2016, more than 1000 dog infections 
were identified in Chad that year. The need to 
retrospectively obtain samples is clear. There 
are no historical Guinea worm samples stored 
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in a way that is likely to keep DNA intact, but 
there may be ways to circumvent the need for 
well-preserved DNA. Recent developments in 
paleogenomic methods, including improve- 
ments in the efficiency of building genomic 
libraries and the use of enzymatic treatments 
to reduce the impact of DNA damage, allow 
access to the highly fragmented and chemi- 
cally modified genomic information recov- 
ered from archival specimens, and even from 
environmental sources such as archaeologi- 
cal specimens or sediments (11, 12). A precise 
picture of the historical population structure 
and size of these parasites will allow us to 
parameterize models of inbreeding and gene 
flow, improving the accuracy of inferences 
about the effect of control measures on cur- 
rent populations (73). 

For many species, researchers have started 
exploring the genetics of populations using 
DNA “fingerprints” obtained from the differ- 


ential distribution of microsatellites—repeat 
sequences in DNA—and other molecular 
markers. However, reductions in cost make 
whole-genome sequencing the most appro- 
priate tool for the future, as genome-wide 
data are more powerful, and more compa- 
rable between populations and pathogens, 
and sequencing avoids the need to develop 
marker panels. At least for most key NTD 
pathogens, basic genomic resources such as 
annotated reference genomes are already 
available. Genome-wide data also enable the 
impact of control measures on pathogen pop- 
ulations to be understood beyond changes in 
distribution or numbers, as genomic data are 
sensitive to changes at functional loci and 
could reveal evolutionary changes of epide- 
miological relevance, such as the emergence 
of drug resistance. The emergence or sudden 
spread of new pathogen genotypes, signa- 
tures of strong recent selection on parts of 


Loss of diversity in a declining population 
As control efforts reduce the size of a population, the chance that an individual is descended from closely 
related parents increases, leading to a decline in genetic diversity and heterozygosity. 
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the genome, or signs of a strong bottleneck 
in the absence of reduced prevalence are 
indications that a population is evolving in 
response to drug treatment or other con- 
trol measures. This approach is being used 
to track artemisinin resistance in human 
malaria parasites in Asia, where population 
genomics approaches have played a key role 
in identifying the major locus responsible for 
reduced efficacy of the drug and understand- 
ing the genetic architecture of this trait (14). 

Clearly, the highest priorities for control 
programs focus on delivering effective con- 
trol measures and maintaining funding and 
political support. However, the rewards for 
collecting genomic data, and establishing 
pretreatment baselines through the analysis 
of archival samples, will be increasingly evi- 
dent as programs mature. There will be many 
challenges in establishing genomic surveil- 
lance programs: the need to be comprehen- 
sive and coordinated, obtaining access to 
both prospective and archival samples, un- 
derstanding the epidemiology and biology of 
the pathogens, to name a few. In some cases, 
theory will need to be developed to explain 
the population genetics of NTD pathogens, 
many of which have genetic systems distinct 
from those of obligately sexual, diploid ani- 
mals. Translation of biomedical science to 
clinical practice has necessitated “bench to 
bedside” collaborations, and similarly, eradi- 
cation genomics will require collaboration 
between technical experts in molecular bi- 
ology and paleogenomics, as well as groups 
with expertise in the biology, control, and 
epidemiology of particular NTDs. The speed 
and scope of changes in NTD pathogen pop- 
ulations happening today is new territory 
for science. The ambitious goals set for NTD 
control will require pathogen populations to 
shrink at an unprecedented rate. However, 
success is not guaranteed (75) and will re- 
quire constant vigilance. ® 
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INSIGHTS | PERSPECTIVES 


FERROELECTRICITY 


Perovskite ferroelectrics go metal free 


Metal-free perovskites exhibit ferroelectric properties rivaling those of BaTiO, 


By Wei Li?” and Li-Jun Ji? 


erroelectric materials, which have spon- 
taneous electric polarization that can be 
switched with an external electric field, 
have applications as capacitors, sen- 
sors, and data-storage devices (1). Since 
the discovery of high-performance 
BaTiO, and LiNbO,, ferroelectric perovskite 
oxides have dominated industrial applica- 
tions (2). However, these ceramics are expen- 
sive to produce as high-quality thin films, and 
they often contain heavy metals such as Pb”*, 
which raises health and environmental con- 
cerns. Replacing heavy metals with organic 
components in the perovskite lattice was 


Many hybrid organic-inorganic perovskites 
have been synthesized by replacing the single 
atomic X site with bitopic organic linkers 
such as formate and dicynamide groups (8). 
Can this strategy extend to the B site? The 
perovskite framework is constructed via co- 
ordination bonding in oxides and hybrid 
systems. Formation of coordination bonds 
requires that a ligand donate lone-pair elec- 
trons to the orbitals of transition metals. 
Likewise, for forming hydrogen (H) bonds, 
electronegative atoms with lone-pair elec- 
trons are needed as acceptors. In this sense, 
these two different bonding interactions 
exhibit some geometrical similarity. As the 
B site requires spherical symmetry to bond 


Ferroelectric origins in perovskites 
Ferroelectricity in perovskites is induced by discrete structural changes that occur at phase transitions. 


Perovskite oxides 

Ferroelectricity primarily arises from the 
displacement of the Ti** ion from the center 
of the TiO, octahedron in BaTiO,. 


considered an alternative (3), but hybrid or- 
ganic-inorganic perovskites have not shown 
comparable ferroelectricity and stability (4). 
On page 151 of this issue, Ye et al. (5) report 
metal-free perovskites that can compete with 
ferroelectric oxides (see the figure). 
Perovskites have a general formula ABX,, 
where A and B are large and small metal cat- 
ions, respectively, and X is a bitopic anion 
(for example, O?-) that coordinates to B. The 
structural topology can be maintained if the 
A- and X-site ions are replaced with organic 
molecules within certain size ranges (6). The 
obvious starting point would be substituting 
the large A-site metal cation, and the success- 
ful structural characterizations of MAPbX, 
(where MA is methylammonium, and X is 
Cl, Br, or I) opened up such possibilities (7). 
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Polar diamine 
cation** 


” Te =P 


Metal-free perovskites 
Ferroelectricity in the materials of Ye et al. 
primarily stems from the order-disorder of 
the polar diamine cation. 


with six neighboring X sites for forming an 
octahedron, replacing it with a molecular 
cation that requires an octahedral H-bonding 
mode to bind with appropriate X-site accep- 
tors could result in metal-free perovskite 
compounds. Indeed, the first metal-free 
perovskite, (piperazinium)(NH,C1,)-H,O, was 
synthesized by assembling the NH,* with 
chlorides through H-bond interactions (9). 
Guided by these molecular design princi- 
ples, Ye et al. have designed anew class of ferro- 
electric metal-free perovskites. The inclusion 
of noncentrosymmetric or chiral organic 
amine cations imposed the polar symmetry 
that rendered the desired ferroelectric order- 
ing in the perovskite lattice. The prototypi- 
cal compound MDABCO-NH,I, (MDABCO, 
bis-N-methyl-N’ -diazabicyclo[2.2.2 ]octo- 
nium) exhibits a high Curie temperature (T, of 
448 K) and substantial spontaneous polariza- 
tion (P, of 22 C/cm?), which compare well with 
the properties of commercial BaTiO,. Such 
notable 7, and P, values of MDABCO-NH,I, 
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should enable wide potential applications in 
environments where conventional oxide ma- 
terials dominate. In addition, their multiple 
polarization directions enable facile rotation 
through electrical-field manipulation, which 
should make the poling process easier (JO). 

The substantial organic content of the 
metal-free perovskites makes these com- 
pounds soft, so they respond to stress dif- 
ferently compared with oxides and hybrid 
perovskites, in which stronger ionic and co- 
ordination bonding forces exist. This facili- 
tates facile growth of high-quality thin films 
that are critical for managing electric dipoles 
through strain engineering. These new metal- 
free perovskite ferroelectrics are simple to 
synthesize, low cost, and lightweight. 

The marriage between symmetry and H 
bonding in these metal-free compounds ex- 
tends the fields of perovskites and ferroelec- 
trics to a new realm. However, some exciting 
aspects need further exploration. Bonding in- 
teractions between NH,* and halides extend 
beyond H bonding, and their electronic na- 
ture needs to be further unveiled. Some of the 
transitions in these metal-free perovskites are 
strongly first-order and ferroelastic, which 
leads to substantial shear strains and dis- 
tinctive twin-wall dynamics (17). Ferroelastic 
twin walls show exotic physical properties 
beyond the bulk material (72), so knowledge 
of the twin-wall nature in these metal-free 
perovskites would enrich their potential for 
high-density memory devices (13). Finally, 
the existence of chirality in these molecular 
perovskites complicates the alignment of 
electric dipoles in their crystal lattices, so 
understanding its coupling with ferroelectric 
ordering could lead to unexpected properties 
at the bulk and nanoscale levels (14). 
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RETROSPECTIVE 


Jens Christian Skou (1918-2018) 


A pioneer in the biochemistry of membrane proteins 


By Poul Nissen 


ens Christian Skou, a pioneer of bio- 

membrane research and molecular 

physiology, died on 28 May, just a 

few months shy of his 100th birth- 

day. The Danish physiologist dis- 

covered the enzyme known as the 
sodium-potassium adenosine triphospha- 
tase (Na‘*,K*-ATPase), or sodium-potassium 
pump, which exports sodium ions from 
and imports potassium ions into cells. Al- 
most all modern biochemistry, cell biology, 
and physiology textbooks showcase the 
sodium-potassium pump, which accounts 
for about 20 to 30% of the energy used by 
cells through hydrolysis of ATP in human 
and animal bodies and up to 70 to 80% in 
the brain. In recognition of this first dis- 
covery of an ion-transporting ATPase, Skou 
shared the 1997 Nobel Prize in Chemistry 
with chemists Paul D. Boyer and John E. 
Walker for their work on the mechanism of 
ATP synthesis. 

Skou grew up in the coastal town of Lem- 
vig at Limfjorden in Northern Jutland, Den- 
mark. He earned his medical degree from 
the University of Copenhagen in 1944, and 
before any consideration of a research career 
had even struck him, he practiced medicine 
at a small hospital in the town of Hjgrring, 
also in Northern Jutland. In 1947, to help his 
career in medicine along by earning a doc- 
torate degree, Skou moved to the then very 
young Aarhus University to do research in its 
small department of physiology. His work fo- 
cused on the mechanism of anesthetics. This 
brought him to lipid monolayers as models 
of the nerve cell membrane, and then to a 
research visit to the marine biology research 
station in Woods Hole, Massachusetts, in 
1953. The trip was a transformative moment 
of his career. Unlike his small Aarhus depart- 
ment, Woods Hole was a large and dedicated 
research environment with intense scholar 
interactions and a huge library of scientific 
journals and books. There, Skou came across 
the curious observations of an ATPase activ- 
ity in the nerves of giant squid. He took these 
studies back home to Aarhus and applied 
them to a more manageable source of nerves 
from local crabs, which he found displayed 
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the same activity. Leaving aside his medical 
career plans, he instead devoted himself to 
research. Skou and a slowly growing research 
community characterized the enzyme, guided 
by the excellent controls of tractable sub- 
strates and specific inhibitors, and through- 
out his career, this system would define his 
work. He remained at Aarhus University and 
continued to do research even after his retire- 
ment in 1988. 

In 1957, Skou published his first paper 
on the Na*,K*-ATPase enzyme. In it, he 
speculated that the enzyme might be the 
long-sought sodium-potassium pump that 
maintains steep electrochemical gradients 
for Na* and K* across the cell membrane 


and enables the majority of transmembrane 
transport and signaling processes, includ- 
ing the action potentials of firing neurons. 
He proposed that the pump should span 
the cell membrane, an idea that at the time 
was heavily disputed. Despite insight just 
gained from the myoglobin structure that 
made the hypothesis seem unlikely, Skou 
persisted, and later in the 1960s and early 
1970s, when better models of biomem- 
branes were presented and the first struc- 
ture of a membrane protein was visualized, 
his theory was shown to be true. 

Aarhus University came of age, and in the 
mid-1960s when physiologist Peter Leth Jor- 
gensen joined the department, a highly pro- 
ductive research community emerged with 
a critical spark of creativity and drive. Skou 
and Jorgensen’s internal competition and 
fierce discussions were well-known; neither 
of them would prioritize compromises and 
“hygge’—a Danish social concept of relaxed 
tolerance, comfort, and contentment—over 
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scientific rigor. Whereas Skou was the classi- 
cal biochemist, Jorgensen became a molecu- 
lar biologist. The two groups complemented 
each other in both style and results. 

I first met Skou in 1991 when I was a 
master’s student in Aarhus University’s de- 
partment of chemistry. In Jens Nyborg’s lab, 
we were working on the EF-Tu protein and 
transfer RNA by crystallography. Skou came 
to request the use of our French press homog- 
enizer to disrupt cell membranes and isolate 
Na*,K*-ATPase from shark rectal gland. I 
used the device often, so people directed 
him to me. Crystallization of the sodium- 
potassium pump seemed an unreal possibil- 
ity at that time, but the encounter led to our 
first discussions about how to proceed. 

It wasn’t until 2007 that I talked again at 
length with Skou. Our group, which included 
biophysicist Jesper Vuust Moller, had deter- 
mined the Ca?*-ATPase crystal structure and 
then (with physiologist Bente Vilsen, also at 
Aarhus) the first crystal structure of Na*,K*- 
ATPase. Skou had invited himself to see it. I 
was still a newcomer to the Na*,K*-ATPase 
community, and I was quite nervous about 
hosting this giant of the field. Fortunately, 
Skou set me at ease by drawing me into a 
chat about sea trout fishing. 

I learned that Skou was a man of the sea. 
He grew up by the shores of Limfjorden, and 
his research ideas had materialized at Woods 
Hole. His favorite sources for Na*,K*-ATPase 
studies came from crab and shark, and local 
fishermen were his most trusted suppliers. A 
devoted fly fisher of sea trout, he spent time 
on the shores of the beautiful peninsula of 
Mols across the Bay of Aarhus, where he had 
a perfectly located summer cottage. 

People would often describe Skou as stub- 
born, but he was a modest man and a clear 
supporter of the social equality that emerged 
in Denmark in the 20th century. His wife, 
Ellen Margrethe, was active in politics and 
a respected voice on the public health care 
system. He was also a strong proponent of 
work-family balance and would always finish 
work at the lab in the afternoon to join his 
family. His Nobel Prize gave him the author- 
ity to advocate giving researchers the free- 
dom he had been given in his early career to 
pursue creative ideas and follow his curios- 
ity. He wished that today’s younger genera- 
tions could have the same leeway to explore 
that had allowed him to develop his original 
discoveries. Jens Christian Skou personified 
the creativity and courage of a great scientist, 
and he was a quiet and modest role model to 
us all. He will live on in our research, discus- 
sions, and memories. 
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INSIGHTS 


POLICY FORUM 


CITIZEN SCIENCE 


Citizen science, public policy 


New research models may benefit from policy modifications 


By Christi J. Guerrini, Mary A. Majumder, 
Meaganne J. Lewellyn, Amy L. McGuire 


itizen science initiatives that support 

collaborations between researchers 

and the public are flourishing. As a re- 

sult of this enhanced role of the public, 

citizen science demonstrates more di- 

versity and flexibility than traditional 
science and can encompass efforts that have 
no institutional affiliation, are funded en- 
tirely by participants, or continuously or sud- 
denly change their scientific aims. But these 
structural differences have regulatory impli- 
cations that could undermine the integrity, 
safety, or participatory goals of particular cit- 
izen science projects. Thus far, citizen science 
appears to be addressing regulatory gaps 
and mismatches through voluntary actions 
of thoughtful and well-intentioned practitio- 
ners. But as citizen science continues to surge 
in popularity and increasingly engage diver- 
gent interests, vulnerable populations, and 
sensitive data, it is important to consider the 
long-term effectiveness of these private ac- 
tions and whether public policies should be 
adjusted to complement or improve on them. 
Here, we focus on three policy domains that 
are relevant to most citizen science projects: 
intellectual property (IP), scientific integrity, 
and participant protections. 

Although the definitional bounds of citizen 
science are debated, there is general consensus 
that citizen science encompasses scientific en- 
deavors in which individuals without specific 
scientific training participate as volunteers in 
one or more activities relevant to the research 
process other than (or in addition to) allow- 
ing personal data or specimens to be collected 
from them. These activities might take place 
at any point during the research process and 
include participation in study design, data 
collection and analysis, and dissemination of 
results (J). They might even encompass the 
entirety of the research process where, as in 
coordinated self-experimentation, the role of 
professional scientists is minimal. 

Recognizing the potential for citizen sci- 
ence to advance scientific knowledge and 
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promote public support of scientific activi- 
ties, professional associations have emerged 
in the United States, Europe, and Australia to 
support citizen science efforts, including con- 
sideration of policy interactions. Meanwhile, 
the U.S. government recently passed legisla- 
tion that supports agency use of citizen sci- 
ence and crowdsourcing to conduct projects 
that advance their missions (2). 


INTELLECTUAL PROPERTY 

Citizen scientists are usually not paid for 
their contributions, although they might ben- 
efit from participation in other ways. When 
citizen scientist volunteers are not com- 
pensated, their work is not subject to most 
employment-related laws and practices that 
govern their scientific collaborators. Some 
employment-related laws and practices have 
implications for ownership of IP, including 
copyrights and patents. Although domestic IP 
laws generally do not reach beyond national 
borders, international treaties have harmo- 
nized basic IP protections. Thus, application 
of many U.S. protections to citizen science 
contexts is generalizable to other countries, 
though some important differences persist. 

U.S. copyright protection extends to au- 
thors of every original work fixed in a tan- 
gible medium. An exception exists for a 
“work made for hire” that is prepared by an 
employee within the scope of employment or 
by agreement as a specially ordered contribu- 
tion to a collective work (3). Because citizen 
scientists are volunteers, any copyrightable 
works that they develop in the course of a 
project—for example, photographs, writings, 
and creative selections or arrangements of 
scientific data—likely do not constitute works 
for hire absent valid agreements otherwise. 
Rather, the volunteer who created the works 
is the copyright “author” and owner. Other 
countries do not recognize an automatic 
transfer of ownership to employers or com- 
missioning parties (3). 

Where projects do not require copyright 
assignment as a condition of volunteering, 
a citizen scientist who retains ownership of 
his or her works under governing domestic 
law can refuse to grant permission to publish 
them, which could disrupt the scientific pro- 
cess or prevent the dissemination of findings. 
However, projects can and often do avoid 
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potential disputes using Creative Commons 
licenses, which allow creators to retain their 
copyrights while permitting others to copy 
and use their works. 

A similar tool is not widely available for 
patents. In most countries, the legal inven- 
tor of a patentable discovery is one who con- 
tributes to its conception, and in the United 
States, only inventors may apply for patents. 
Employers typically require their scientist 
employees to assign any rights to future in- 
ventions as a condition of employment, in- 
cluding decisions to apply for patents and 
license inventions. By contrast, patent assign- 
ments do not appear to be a typical condition 
of volunteering in citizen science projects. 
Yet, it is possible that some citizen scientists 
will make contributions that support claims 
of sole or co-inventorship under domestic 
patent laws (4). For example, Sharon Terry, 
a self-described citizen scientist who helped 
discover the gene responsible for her chil- 
dren’s rare disease, pseudoxanthoma elasti- 
cum, is a co-inventor on U.S. patents related 
to that discovery (5). Challenges may arise if 
the citizen science inventor who has retained 
IP rights exercises them to exclude projects 
from using the patented inventions or dis- 
agrees with co-inventors on whether to pur- 
sue patents or license inventions to others. 

Contracts can be used to clarify rights 
and establish expectations related to pat- 
ent and other IP rights. Projects might take 
a page from employment practices and 
require participants to assign future pat- 
ent rights to project leaders as a condition 
of participation. However, such a practice 
seems incompatible with models that view 
citizen scientists as respected partners in the 
research process. The online citizen science 
game FoldIt, for example, does not require 
patent assignments. Instead, FoldIt’s IP pol- 
icy provides that “players who contributed to 
the discovery will be considered co-inventors 
for any discovery produced through play” 
and issues of ownership will be handled at 
a later date by the University of Washington, 
where the game was developed and is man- 
aged (6). It remains to be seen how well this 
approach will work in practice (7). 

Alternatively, contracts can incorporate 
advance commitments by citizen scientists 
and professional scientists to cede steward- 
ship of patents to nonprofit organizations 
that support research. Ms. Terry’s advocacy 
organization, for example, is the assignee of 
the patented inventions that she helped dis- 
cover (5). However, it is unclear whether this 
model would be readily accepted in contexts 
where citizen scientists do not have funding 
or other leverage to negotiate IP control. 

Citizen scientists and professional sci- 
entists might forego patenting discoveries 
resulting from their collaborations or grant 
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each other nonexclusive, royalty-free cross-li- 
censes to any discoveries. Although the devel- 
opment of standardized agreements for each 
of these scenarios seems achievable, support 
for such agreements may be limited where in- 
terest in commercialization is high (7). One- 
way material transfer agreements, which are 
a common vehicle for transferring research 
materials between institutions, might be 
adapted to promote sharing by and with citi- 
zen scientists. But in some cases, they may 
add unnecessary complexity to what might 
otherwise be straightforward transactions, 
and they are notoriously difficult to monitor 
and enforce. 

In sum, although contractual approaches 
have limits, the ability to tailor contracts 
to circumstances makes them more practi- 
cal than legislative approaches that would 
change the legal rules to produce different 


Citizen scientists 
collect samples 
inaU.s. 
national park. 


default outcomes—assuming that consensus 
on optimal outcomes even could be reached. 
As a policy matter, resources are probably 
better spent encouraging transparency and 
negotiation of IP terms at the outset of citizen 
science collaborations. 


RESEARCH INTEGRITY 

Concerns about the quality of data contrib- 
uted by citizen scientists and the soundness 
of their collection, reporting, and analyti- 
cal techniques have long been raised (8, 
9). Although professional scientists are not 
immune from quality transgressions, meth- 
odological rigor is central to their training 
and professional advancement, and formal 
mechanisms exist for holding professional 
scientists accountable for the quality of 
their work. Volunteers, on the other hand, 
may not experience similar external pres- 
sures to ensure research integrity (10) and 
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may prioritize other aspects of their par- 
ticipation. Also, there may be fewer op- 
portunities for professional scientists to 
address knowledge gaps and otherwise act 
as a check on quality if their involvement in 
projects is minimal. 

Conflicts of interest can also undermine 
research integrity. Individuals who engage 
in citizen science can have biases stemming 
from their alliances with private, nonprofit, 
and political organizations, as well as their 
involvement in lawsuits (9, 10). They may 
also have biases based on their perceptions 
of how they or their community might be 
harmed or benefited by particular findings. 
Further, and especially with respect to re- 
search on politically charged topics, it is not 
beyond the realm of possibility that citizen 
scientists might be recruited to provide bad 
data to “improve” results or even sabotage re- 


search, although careful monitoring will help 
ferret out such attacks. 

Of course, research conducted exclusively 
by professional scientists can also be un- 
dermined by personal biases and conflicts 
of interest, but institutional rules, funding 
stipulations, regulatory procedures, and pro- 
fessional norms work to compel their identi- 
fication and disclosure whether the research 
is conducted in for-profit, nonprofit, or gov- 
ernmental settings. Citizen science projects 
that are not embedded within institutions, 
are self-funded, or are otherwise outside of 
regulatory control may operate without these 
traditional safeguards. 

Citizen science projects have adopted 
strategies to promote research integrity, and 
many guides, tools, and templates are avail- 
able to support projects from the planning 
stages through evaluation (JJ, 12). Studies 
indicate that citizen scientists can produce 
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reliable data on par with those produced by 
professionals (72). But it is not yet clear how 
well the best practices that have been de- 
veloped address the many flavors of citizen 
science or are being followed by its practitio- 
ners. These questions should be studied, but 
in the meantime, there may be opportunities 
to promote integrity through policy. 

For example, recently enacted U.S. legisla- 
tion aimed at promoting citizen science and 
crowdsourcing projects by federal agencies 
provides that all data collected through such 
efforts should be made publicly available 
where appropriate and to the extent possible 
(2). Such data accessibility creates opportuni- 
ties to investigate questionable or poor-qual- 
ity data and assess fitness for use through 
independent examination (10). The law also 
requires agencies to “make all practicable ef- 
forts” to ensure that participants adhere to 
federal research misconduct policies, which 
include sanctions for fabricating and falsi- 
fying data (2). For studies conducted with 
federal funds, the penalties for research mis- 
conduct—the most severe being a funding 
ban—will not apply to most citizen scientists 
because they do not seek such funding. How- 
ever, their interests will be affected if profes- 
sional collaborators lose the federal support 
that makes possible their work. 

These regulations will not reach citizen sci- 
ence projects that are not conducted with U.S. 
federal employees or not federally funded. 
Journals that publish citizen science research 
help fill this gap through the peer review 
process and by placing pressure on authors 
to make their study data publicly available. 
Although these policies do not affect citizen 
science projects that are uninterested in 
traditional publication, some projects may 
have commitments to openness that match 
or even exceed journal requirements. For ex- 
ample, DIYgenomics.org publishes protocols 
and data on project wikis. 

A related problem is presented when citi- 
zen scientists have conflicts of interest that 
are not disclosed. Many national regulations 
and journals have adopted requirements of 
conflicts reporting, but they may not reach 
citizen scientists who do not qualify as inves- 
tigators or authors, although they facilitate 
research in meaningful ways. To promote 
accountability, such requirements should 
extend to the disclosure of relevant conflicts 
held by citizen scientists. Problems will arise, 
however, when disclosure is incompatible 
with volunteer terms that allow anonymous 
participation. We support project discretion 
to offer anonymity, especially where sensitive 
information is involved or citizen scientists 
could be subjected to intimidation or harm 
if identified. In such cases, consideration 
should be given to permitting disclosure at 
the aggregate level (10). 
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PARTICIPANT PROTECTIONS 
As uncompensated volunteers, citizen scien- 
tists are unable to rely on traditional labor 
laws to prevent and redress harm because 
they do not qualify as protected workers. The 
Common Rule, which applies to all research 
involving human subjects that is conducted 
or supported by the U.S. government, is di- 
rected at protecting research participants. 
For studies governed by the Common Rule 
and not within an exemption, institutional 
review boards (IRBs) must ensure that any 
risks to subjects are minimized and reason- 
able in relation to anticipated benefits. Stud- 
ies that take place in other countries may 
still be subject to the Common Rule or simi- 
lar national requirements, so consideration 
of the Common Rule’s provisions may be 
broadly instructive. 

Importantly, Common Rule protections 
are directed toward “subjects” of research 
from whom identifiable data or biospeci- 
mens are collected. They 
do not explicitly authorize 
IRBs to consider risks or 
benefits to citizen scientists 


“..citizen scientists 


of funding source. Like the Common Rule, 
however, FDA research subject protections 
do not explicitly account for risks and ben- 
efits associated with supporting scientific 
initiatives in ways other than serving as re- 
search subjects. 

For those citizen science initiatives out- 
side the scope of these federal regulations 
and not conducted in the few states that 
extend these kinds of protections to all re- 
search regardless of funding source, there 
may be no legal requirement to evaluate or 
disclose potential risks to citizen scientists. 
Individuals owe a duty of ordinary care to 
avoid foreseeable injury to others, includ- 
ing general volunteers, but application of 
this common law doctrine is highly context- 
specific and may be unavailable to volun- 
teers for any number of reasons. Moreover, 
some projects may require waivers of lia- 
bility as a condition of volunteering, which 
are not allowed in the context of federally 
regulated research. 

Some unregulated proj- 
ects, like Sage Bionetworks’ 
mPower study of Parkin- 


who facilitate research in @gye both subjects son’s disease, which collects 
other ways. Depending on cae data through participants’ 
the setting and study de- and facilitator: S of mobile phones, voluntarily 
sign, examples of specific research...” close this gap by securing 


risks to citizen scientists 

could include expectations 

of overwork, requirements to assume finan- 
cial burdens, or vulnerability to harassment 
by others. Examples of specific benefits, on 
the other hand, could include training in 
scientific techniques, access to tools and 
data for personal use, or opportunities for 
coauthorship. Where citizen scientists are 
both subjects and facilitators of research, 
IRBs may interpret their jurisdiction to in- 
clude risks and benefits associated with both 
roles. However, in some contexts, citizen 
scientists are only research facilitators and 
not also subjects, or their roles may be am- 
biguous (73). Federal protections also do not 
explicitly contemplate risks and benefits to 
communities, although community-based 
research review processes, including com- 
munity IRBs and advisory committees, have 
emerged to complement traditional IRB re- 
view to ensure that involved communities 
are engaged in and directly benefit from pro- 
posed research and that study designs are 
culturally appropriate. 

Many U.S.-led citizen science initiatives 
are not covered by the Common Rule be- 
cause they are not federally conducted or 
supported (14). A small number may never- 
theless be covered by research subject pro- 
tections adopted by the U.S. Food and Drug 
Administration (FDA), which apply to clini- 
cal investigations that support applications 
for products regulated by FDA, regardless 
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independent IRB review of 

their protocols, although 
this can be cost-prohibitive for many projects. 
Others have proposed ethics evaluations by 
“citizen ethicists” who critique experiments 
and post their opinions online for potential 
participants to review, as well as download- 
able ethics toolkits geared toward citizen sci- 
entists (15). Meanwhile, do-it-yourself biology 
projects may voluntarily adhere to a commu- 
nity code of ethics, which includes directives 
to pursue peaceful purposes and adopt safe 
practices. Until recently, the community also 
supported an online forum through which 
professional biosafety experts answered 
questions posed by citizen scientists. How- 
ever, some projects may fail to engage in any 
kind of risk assessment or management. 

Of the three policy domains discussed, 
participant protections require the most 
immediate attention given that they impli- 
cate direct physical harms, yet collective 
action focused on assessing those harms 
relative to other risks and benefits specific 
to citizen science has thus far been limited. 
How citizen science projects are making 
these risk-benefit calculations, and accord- 
ing to what processes, should be empirically 
investigated. Then, whether these data jus- 
tify extending current legal protections to 
unregulated citizen science, or creating new 
policy frameworks altogether, merits study 
by policy-makers in partnership with the 
citizen science community. 
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If it is determined that additional regula- 
tion is warranted, it is unlikely to occur at the 
federal level given that a proposal to extend 
the Common Rule to additional kinds of re- 
search was recently considered and rejected. 
At a minimum, however, guidance should be 
developed by citizen science practitioners 
and participants in collaboration with ethi- 
cists to aid unregulated projects that cannot 
afford independent IRB review in conducting 
risk-benefit assessments. Such guidance also 
may be useful to traditional IRBs evaluating 
regulated projects in which the role of citi- 
zen scientists is multi-faceted or ambiguous. 
In addition, modes of risk assessment and 
management that are complementary or al- 
ternative to those required by the Common 
Rule and FDA regulations, including but not 
limited to ethics reviews conducted by non- 
traditional IRBs and citizen ethicists, should 
be evaluated to understand the citizen sci- 
ence contexts in which they could be useful. 

As citizen science becomes more prevalent, 
it is increasingly important that policy oppor- 
tunities to support participants and practitio- 
ners are identified and responsibly pursued. 
In the end, some policy adjustments may 
be in the best interests of society, but to be 
successful, they must appreciate the distinct 
ethos of citizen science and be guided by its 
diverse stakeholders. 
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To be a bee 


A charming account 
celebrates the insects’ 
idiosyncrasies and the 
people passionate 
about protecting them 


By Rachael Winfree 


e have bees to thank for some of 

the better features of our world, 

from the fruits and vegetables we 

eat to the world’s flowering plants, 

which radiated alongside them 

during the Cretaceous. Nearly 90% 

of plant species require pollinators (an added 

benefit for anyone who suffers from wind- 

borne pollen allergies). Humans even seem to 

share aesthetic preferences with bees, insofar 

as the shapes, colors, and scents of the flow- 

ers we enjoy originally evolved to appeal to 
these and other pollinators. 

Given the necessity of bees, the signs that 


they are in trouble are concerning. Although 
maladies of the domesticated 
honey bee (Apis mellifera), such as 
colony collapse disorder, have cap- 
tured the most public attention, 
declines in the world’s 20,000 
wild bee species have the potential 
to be even more ecologically cata- 
strophic. In Buzz, Thor Hanson 
deftly presents the predicament 


The sweat bee Augochloropsis sumptuosa is one of the roughly 20,000 known bee species. 


Who knew, for example, that “dumbledore” 
is a historical name for the bumblebee? The 
book’s bibliography reflects the diversity of 
topics covered, ranging from Virgil and Tol- 
stoy to the scientific literature and CNN. 

In one of the more surprising chapters 
(even to an ecologist who studies bees), Han- 
son presents evidence for the key role played 
by honey bees in human evolution. He begins 
with the honeyguide, an African bird species 
that feeds on bee hives but needs 
another, larger species to conduct 
the initial raid on the hive. The 
honey badger was long thought to 
be the honeyguide’s original mu- 
tualist partner. Bringing some ba- 
sic biology to bear, Hanson points 
out that honey badgers don’t like 
to climb trees and in addition 


of bees today, in part by recount- The he — are nearsighted, hard of hearing, 
ing conversations with scientists Necessity of Bees and nocturnal. By contrast, the 
and practitioners who are leading Thor Hanson Hadza people of Tanzania eagerly 


the effort to understand and con- 
serve bees. In only 304 pages, he 
successfully covers a wide range 
of topics, from bee evolution and natural his- 
tory to the economic value of pollination. 

Like all good naturalists, Hanson has a gift 
for seeing the world from his subject’s point 
of view. At one point, he predicts the nest site 
preferences of a bumblebee queen (a boot, it 
turns out). At another, he empathizes with 
bees that won't pollinate alfalfa because the 
flowers’ anthers bop them on the head. 

In addition to his interest in entomology, 
Hanson also has an interest in etymology. 


304 
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seek out honeyguides and _har- 

pp. vest honey from wild honey bee 

hives—so much so that honey con- 

stitutes 15% of the calories in the Hadza diet. 

The availability of honey may even have fur- 

thered the evolution of our unusually large 
brains, the book reveals. 

Hanson is an upbeat and often humor- 
ous guide, with a DIY attitude, the latter of 
which sometimes lands him in odd situa- 
tions (such as sitting in a McDonald’s, us- 
ing tweezers to separate the bee-pollinated 
from the non-bee-pollinated components 
of a Big Mac). In another chapter, after 
reading that branched, pollen-collecting 
hairs are a distinctive trait that separates 
bees from their evolutionary forebears the 
wasps, he tests the hypothesis that minute 
particles adhere better to bees than wasps 
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by weighing a bee and a wasp before and 
after dipping each in flour. 

Hanson also has a sharp eye for the id- 
iosyncratic—which he doesn’t have to look 
far to find among bees or among bee en- 
thusiasts. In the first category, he describes 
bees that nest inside active termite mounds 
and others that construct “elaborate vases” 
made of flower petals. Still others carry 
pollen on their abdomens, making each 
bee “look like she’s wearing a tiny, brightly 
colored apron” (as compared with most 
other bees, “whose pollen loads look more 
like stockings pulled up high on their back 
legs.”) In the second, are a retired insurance 
salesman who gives bees away as presents 
and an alfalfa farmer who slows his car to 
a crawl while driving so as to avoid killing 
even a single bee. 

The only criticism I have of Buzz, and 
it is not a major one, is that the book un- 
derplays how little we actually know about 
bees. The chapter covering the decline and 
possible extinction of certain North Ameri- 
can bumblebee (Bombus) species quite 
rightly focuses on the extent and causes of 
these declines. It doesn’t mention, though, 
that Bombus is among the best-studied of 
the world’s 443 bee genera. The conserva- 
tion status of most bee species is simply un- 
known. In the United States, for example, 
there is no federal list of rare or declining 
bee species, and ad hoc assessment of which 
species are in trouble has been left to re- 
searchers and the Xerces Society for Inver- 
tebrate Conservation. 

Overall, however, if you have time to read 
one book on what is happening with mod- 
ern bees, you couldn’t do better than Buzz. 
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Learning from different disciplines 


Conversations spark connections as scientists search for inspiration in other fields 


By Brian Uzzi 


f you knocked on Heaven’s door, and God 
greeted you, what question would you 
ask? What is the nature of human con- 
sciousness, and how can it be expanded? 
Where does the Universe begin and end? 
What is time, and why isn’t it 
constant? What causes deviance? 
And further still, if you received 
an answer, could you decipher 
it? Or would the answer only be 
a clue leading to the next clue? 
The Most Unknown, a docu- 
mentary film in the Simons 
Foundation Science Sandbox series, takes 
the viewer on a fantastical journey of nine 
scientists as they intrepidly knock on Heav- 
en’s door. Each asks a profound question in a 
different way. But all are making a difference 


The Most Unknown 
Ian Cheney, director 
Vice/Motherboard, 2018, 
88 minutes. Available 
on Netflix, August 2018. 


ologist Victoria Orphan describes how tiny 
microbes, too small to see with the naked 
eye, can “eat” greenhouse gases and there- 
fore are a potential source of big solutions 
for climate change. Orphan then travels to 
Boulder, Colorado, where she meets physi- 
cist Jun Ye, who is obsessed with measur- 
ing time with a clock so precise 
that it only loses one second in 
accuracy every million years. 
As the two scientists converse, 
they make a potential connec- 
tion: The secret to the success 
of greenhouse-eating microbes 
may be their particular experi- 
ence with time. At the bottom of the ocean, 
strong gravitation waves bend and slow 
down time, reveals Ye. The change isn’t a 
lot, but it may be a clue to understanding 
the microbes’ specific dietary requirements. 


Microbiologist Jennifer Macalady collects samples from a water source in the Frassasi caves in Italy. 


in how science explains the Universe, the 
world we live in, and the worlds within us. 
Although the filmmaker, Ian Cheney, is 
not a scientist, he makes a point that is now 
well documented scientifically: Scientists 
from different disciplines who immerse 
themselves in one another’s work and ex- 
change ideas solve the hardest problems 
(1). Midway through the film, for example, 
California Institute of Technology geobi- 
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From Boulder, Ye travels to Sussex, United 
Kingdom, where the world’s most accurate 
clock maker comes face to face with neuro- 
scientist Anil Seth, who studies why humans 
can’t keep time without reference points 
of activity. In Seth’s laboratory, subjects in 
magnetic resonance imaging machines are 
asked to guess the runtime of video clips. 
They find that the human perception of time 
is linked to the activity level in the video. 
When persons in a video are highly active 
or objects move quickly, subjects underesti- 
mate the length of the video and vice versa 
for low-activity videos. One potential impli- 
cation of his work is that although our life 


Published by AAAS 


spans are increasing, social media and fren- 
zied news cycles may lead to the perception 
that life is shorter and less fulfilling. 

Science is replete with examples of mys- 
teries that were unlocked through the ac- 
cumulation of sometimes small and often 
separate advances that came from varied 
perspectives. The process of discovery is un- 
certain. What looks like a breakthrough may 
prove to be a dead end, whereas accidents 
may provide a result—a piece of a larger 
puzzle—that suddenly makes all the other 
separate pieces make sense collectively. 

On an uninhabited island off the coast of 
Puerto Rico, Seth meets Yale psychologist 
Laurie Santos, who studies deviance—theft, 
to be exact. Santos is specifically interested 
in the conditions under which monkeys in 
the wild risk taking a five-finger discount 
of succulent treats. In a moment of levity, 
an accident messes up the experiment to be 
filmed, but the presence of the documen- 
tary crew reveals something unexpected 
about monkey behavior: Even if the re- 
searcher is looking away, attempted theft is 
less likely to occur when there are potential 
third-party witnesses. A serendipitous mis- 
take gives Santos her next clue. 

Woven into the film’s electrifying tapestry 
of ideas, persons, and places is the story of 
the hard work of science. Great discovery, we 
see, comes part and parcel with emotional 
frustration and disappointment, unusual 
hours that buck circadian rhythms, tight 
spaces, cold water, and dark places. 

In The Most Unknown, Cheney connects 
apparently disparate journeys of discovery, 
illustrating where interdisciplinary team- 
work can fill in a scientist’s blind spots. 
When seen through this lens, science can 
be viewed as a big import-export business 
of ideas. Conventional, well-understood 
ideas in one area, when brought into an- 
other scientific domain, are suddenly seen 
in a new light. 

Newton’s observations that “standing 
on the shoulders of giants” is key to sci- 
entific discovery may only have been half 
right. As documented in The Most Un- 
known, scientists also have much to learn 
from their contemporaries. 

REFERENCE 
1. B.Uzzi,S. Mukherjee, M. Stringer, B. Jones, Science 342, 
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Brazil naturalizes 
non-native species 


Brazil’s national policies are putting the 
country’s megadiversity at risk (—3). After 
passing a 2016 law that will put constraints 
on biodiversity research (3), the Ministry 

of Agriculture, Livestock, and Supply has 
taken another controversial action: A new 
ordinance proposes that introduced aquatic 
species in Brazil should be considered 
“native” (4), including invasive species in 
the Neotropics (J, 5). Classifying introduced 
aquatic species as if they are indigenous 

to Brazil could potentially cause even 

more introductions and lead to the loss of 
ecosystem services and functions, as well as 
traditional knowledge about native spe- 
cies (6). Moreover, Brazil shares some large 
river basins (such as Paraguai, Parana, and 
Amazon) with other countries. Therefore, 
Brazil will become a major source of non- 
native species for other countries in South 
America. The rate of introductions in Brazil 
will likely outpace the research investigating 
their negative effects (J, 3). 

This is not the first time that political 
decisions have tried to categorize non-native 
species as native. In 2009, the Brazilian 
Congress proposed a law that intended to 
“naturalize by decree” several non-native 
fishes to foster aquaculture development 
(5). The most recent ordinance is based on 
a 2016 law that considers introduced spe- 
cies with established populations as part of 
the Brazilian genetic heritage. Non-native 
species such as the Malaysian giant prawn 
(Macrobrachium rosenbergit), African catfish 
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(Clarias gariepinus), and American bullfrog 
(Lithobates catesbeianus) have established 
populations in some localities (7), and the 
approval of this new ordinance will permit 
their free trade and rearing across Brazil. 
This retrogression conflicts with several 
Aichi Biodiversity Targets, especially the 
one related to the prevention, control, or 
eradication of non-native species (8). Brazil 
harbors the most diverse aquatic biota 
in the world (9), and it is imperative that 
local authorities take appropriate measures 
that value and preserve native biodiversity. 
Basic research and knowledge produced 
by scientists (10) should play a vital role in 
these decisions. 
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Brazil may remove 
restrictions on the 
trade of introduced 
species such as 

this American bullfrog. 
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Free satellite data 
key to conservation 


Biodiversity is in crisis, with extinction 
rates orders of magnitude higher than 
background levels (7). Underfunded con- 
servationists need to target their limited 
resources effectively. Over the past decade, 
satellite remote sensing has revolutionized 
our ability to monitor biodiversity globally, 
and is now used routinely, especially by 
nongovernmental organizations, to detect 
changes, set priorities, and target conser- 
vation action. The U.S. Geological Survey 
(USGS) unlocked high-resolution Landsat 
data in 2008 (2), making data available 
online (3), and the Copernicus program 
from the European Commission subse- 
quently made their data available as well 
(4). These resources have been instrumen- 
tal to biodiversity research. Assessments of 
environmental changes such as deforesta- 
tion are now readily available. The current 
spatial and spectral resolution of Landsat 
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data make them appropriate to many con- 
servation applications, and although they 
are not always ideal, pragmatic researchers 
with limited resources use them regularly. 
Conservationists have already called for 
these data to remain free (5). Consequently, 
the news that USGS may charge for data 
(6) is deeply troubling. 

USGS has recently convened an advisory 
committee to determine whether users 
would be prepared to pay for increased 
spectral and spatial resolution images (7). 
Requiring users to pay would put these 
images beyond the reach of conservation- 
ists. It would halt time-series analyses 
that have been useful in monitoring the 
effects of climate change, land-cover 
change, and ocean surfaces, likely hinder- 
ing the achievement of the Sustainable 
Development Goals (8). We urge the USGS 
to reconsider their position and continue 
to provide data from the Landsat program 
freely to all users. 
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Funding agencies can 
prevent harassment 


Harassment and lack of physical safety in 
fieldwork and laboratories exists across a 
range of disciplines (/, 2). Editorials and 
#MeToo stories have recently highlighted 
that research is often conducted under 
“macho” conditions in which harassment, 
bullying, and unsafe work environments 
are common (3, 4). In response, codes of 
conduct for researcher safety are on the 
rise (3, 5). However, national research 
funds, private funding organizations, and 
monitoring agencies rarely require that 
the recipients of their grants implement 
codes of conduct or safety standards (2). 
Opportunities for cultural change should 
rest not only with individual scientists, 
teams, and professional societies. Funding 
agencies should share the responsibility. 
The cost of ensuring researcher safety 
should be part of the overall budget, and 
predefined safety standards should prevent 
situations in which harassment could 
occur (2). For example, when companies or 
institutions need scientists to do con- 
tracted monitoring work, bidding prices 
often determine whom they select. Unless 
funding agencies require safety standards, 
such bidding prices will always favor 
low-cost solutions that neglect safety. As 
another example, when principal investiga- 
tors (PIs) write applications, they should 
budget for training and counseling to 
prevent and address harassment. Such 
measures would be more widespread if 
funding agencies acknowledged them. 
Funding agencies have the power to 
participate in changing the culture by 
requiring codes of conduct for accept- 
able behavior from their grant recipients. 
Forcing researchers and companies to 
incorporate safety standards into grant 
proposals and assignment bids will increase 
awareness about harassment and stressful 
working environments. Only through full 
support from the broad spectrum of players 
involved in science will it be possible to cre- 
ate an inclusive and responsible culture that 
ensures safe workspaces. 
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Comment on “Designing river flows to 
improve food security futures in the Lower 
Mekong Basin” 


John G. Williams 

Sabo et al. (Research Articles, 8 December 
2017, p. 1270) use sophisticated analyses of 
flow and fishery data from the Lower Mekong 
Basin to design a “good” hydrograph that, 

if implemented by planned hydropower 

dams, would increase the catch by a factor of 
3.7. However, the hydrograph is not imple- 
mentable, and, if it were, it would devastate the 
fishery. Further, the analyses are questionable. 
Full text: dx.doi.org/10.1126/science.aatl225 


Comment on “Designing river flows to 
improve food security futures in the Lower 
Mekong Basin” 


Ashley S. Halls and Peter B. Moyle 

The designer flow regime proposed by Sabo 
et al. (Research Articles, 8 December 2017, 
p. 1270) to support fisheries in the Lower 
Mekong Basin fails to account for important 
ecological, political, and economic dimen- 
sions. In doing so, they indicate that dam 
impacts can be easily mitigated. Such an 
action would serve to increase risks to food 
and livelihood futures in the basin. 

Full text: dx.doi.org/10.1126/science.aatl989 


Response to Comments on “Designing river 
flows to improve food security futures in the 
Lower Mekong Basin” 

G. W. Holtgrieve, M. E. Arias, A. Ruhi, 

V. Elliott, So Nam, Peng Bun N¢or, T. A. 
Rasanen, J. L. Sabo 

Sabo et al. presented an empirically derived 
algorithm defining the socioecological 
response of the Tonle Sap Dai fishery in the 
Cambodian Mekong to basin-scale variation 

in hydrologic flow regime. Williams suggests 
that the analysis leading to the algorithm is 
flawed because of the large distance between 
the gauge used to measure water levels 
(hydrology) and the site of harvest for the 
fishery. Halls and Moyle argue that Sabo et al.'s 
findings are well known, and contend that the 
algorithm is not a comprehensive assess- 
ment of sustainability. We argue that Williams’ 
critique stems from a misunderstanding 
about our analysis; further clarification of the 
analysis is provided. We regret not citing more 
of the work indicated by Halls and Moyle, yet 
we note that our empirical analysis provides 
additional new insights into Mekong flow- 
fishery relationships. 

Full text: dx.doi.org/10.1126/science.aatl477 
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Comment on “Designing river flows to 
improve food security futures in the 
Lower Mekong Basin” 


John G. Williams 


Sabo et al. (Research Articles, 8 December 2017, p. 1270) use sophisticated analyses of 
flow and fishery data from the Lower Mekong Basin to design a “good” hydrograph that, if 
implemented by planned hydropower dams, would increase the catch by a factor of 3.7. 
However, the hydrograph is not implementable, and, if it were, it would devastate the 
fishery. Further, the analyses are questionable. 


n a methodological tour de force, Sabo et al. 

(1) relate components of the flow regime of 

the Mekong River, quantified as stage at Stung 

Treng, to catch in the important Dai fishery on 

the Tonle Sap River, Cambodia. They use these 
empirical relationships to design a “good” flow 
regime proposed as an “ecological objective func- 
tion” for the operation of controversial dams 
planned for the Mekong River. They project that 
this flow regime would increase yield of the Dai 
fishery by a factor of 3.7. However, simple flow 
routing considerations show that the “good” re- 
gime could not be implemented, and if it were, it 0 200 
would devastate the Dai fishery. 

Sabo et al. also depict the historical flow re- 
gime and a “bad” regime as plots of stage over 
time (Fig. 1). The plot of the “good” regime is a 
roughly rectangular wave, with long low-flow pe- 
riods connected to shorter high-flow periods by 
transitions ~15 days long. The transitions are more 
gradual in the “bad” regime. 

Stung Treng is several hundred kilometers 
upstream from the Tonle Sap-Mekong conflu- 
ence. The Tonle Sap connects the Mekong to a 
large natural lake and adjacent floodplains that 
support large populations of many fishes (2). 
Water flows up the Tonle Sap to the Great Lake 
during high flows in the Mekong, and back down 
the Tonle Sap to the Mekong during low flows. 
Flow in the Mekong is a major driver of stage in 
the Great Lake, but the Great Lake receives ~40% 
of its water from tributaries and rainfall (3), and 
the stage records differ substantially (Fig. 2). 

Sabo et al. state that “...we used estimated 
hydrologic drivers of the historical bag net, or 
‘Dai, fishery on the Tonle Sap River—the largest 
commercial fishery in the Mekong—to design bet- 
ter fisheries futures by comparing designed flows 
to current and pre-dam (natural-flow) regimes,” 
and they hypothesize that “Flow variation (high 
and low) may drive production by controlling 0 200 
redox conditions in floodplain soils, ...” This im- 
plies that they intend their “good” flow regime 
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for the Tonle Sap-Great Lake system, and perhaps 
for other reaches of the lower Mekong with large 
floodplains. 

Implementing the “good” flow regime would en- 
tail managing releases from the most downstream 
dam, well upstream from the Tonle Sap-Mekong 
confluence. During the high-flow period, releases 
would be large and the floodplain would be in- 
undated. At the end of the high-flow period, 
releases from the dam would be cut to create the 
desired decrease in stage. Downstream from 
the dam, water would drain from the flood- 
plain to the channel as soon as releases were 
cut, reducing the rate of decrease in stage (and 
discharge) in the channel farther downstream. 
The opposite would happen, more strongly be- 
cause of downstream overland flow, at the end 
of the low-flow season, prolonging the transition 
from dry to wet conditions. The situation is too 
complex for simple analysis, but based on hydro- 
dynamic modeling, about 35 km? of water flowed 
onto the left bank of the Mekong upstream and 
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Fig. 1. Illustration of designed flows, including a “good” design and a “bad” design, compared 
to the reconstructed historical flow regime. [Copied from Sabo et al. (1)] 
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Fig. 2. Stage of the Mekong 14 


River at Stung Treng (solid = 


line) and the Great Lake 12 4 
at Kampong Luong (dashed 
line) for two flood cycles. 
The flood pulse in the Great 
Lake lags that at Stung Treng, 
and high-frequency variation 
is suppressed. [Data from 
Sameng Preap, Mekong River 
Commission] 


Stage (m above datum) 


— Stage at Kampong Luong 
Stage at Stung Treng 


on the opposite side from the Tonle Sap confluence 
in 2002 (4), so flows between the channel and 
floodplain are substantial. Thus, the proposed rect- 
angular wave would degrade to something more 
like the “bad” flow regime as it moved downstream. 

Regarding the Tonle Sap, the average annual 
outflow from the Great Lake exceeds 75 km? (3). 
Draining this volume in 15 days would require a 
constant discharge of ~58,000 m? s/, which is 
almost 6 times the high flow reported in (2) and 
greater than current peak flows in the Mekong. 
This discharge could not plausibly be carried by 
the existing Tonle Sap channel. 

The Dai fishery uses what are essentially an- 
chored trawls, with mouths facing upstream, that 
depend on current to function. The nets target 
fishes migrating out of the Great Lake to the 
Mekong River with the falling limb of the hydro- 
graph, as the Great Lake drains over 5+ months 
(2). If somehow the “good” flow regime could be 
implemented, shortening the falling limb of the 
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hydrograph to ~15 days would truncate the pe- 
riod when the nets might be used by a factor of 
about 11. Until the channel enlarged in response 
to the higher flows, the flow probably would be 
too swift to use the fine mesh nets needed to 
catch the small fishes that make up the bulk of 
the Dai fishery catch (2), even if it were possible 
to anchor the nets. 

Sabo et al. “hypothesized that high fisheries 
yields are driven by measurable attributes of hy- 
drologic variability, and that these relationships 
can be used to design and implement future flow 
regimes that improve fisheries yield through 
control of impending hydropower operations.” 
Indeed, the size of the flood pulse in the Great 
Lake is a good predictor of the Dai fishery catch 
(2). Sabo et al. found associations between several 
other attributes of variability in stage at Stung 
Treng and the catch, but whether any are “hy- 
drologic drivers” of catch remains an open ques- 
tion because the flow-catch records are short 


(17 years); the analysis was not specified in ad- 
vance (5); other important factors such as habitat 
cover, sedimentation, and net primary produc- 
tion (6) were ignored; and it seems improbable 
that one drastic change in the flow regime would 
be good for the fishery whereas a modestly dif- 
ferent drastic change would be bad (7). Intuitively, 
variation in stage in the Great Lake seems more 
likely to influence catch in nets targeting fishes 
migrating out of the lake than does variation in 
stage in the Mekong River hundreds of kilometers 
upstream. 

Despite the sophistication of the methods used 
to develop empirical relationships, predictions 
based on them depend on the assumption of cau- 
sality and the assumption that all else will remain 
more or less equal, which clearly will not be true if 
the proposed dams are built and the hydrograph 
is radically altered. Complex computer-based 
analyses can be marvelously powerful tools, but 
they should be used carefully, and it is always 
necessary to think clearly about the results. 
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Comment on “Designing river flows to 
improve food security futures in the 
Lower Mekong Basin” 


Ashley S. Halls’ and Peter B. Moyle?* 


The designer flow regime proposed by Sabo et al. (Research Articles, 8 December 2017, 
p. 1270) to support fisheries in the Lower Mekong Basin fails to account for important 
ecological, political, and economic dimensions. In doing so, they indicate that dam 
impacts can be easily mitigated. Such an action would serve to increase risks to food 


and livelihood futures in the basin. 


abo et al. (1) propose that dams in the Lower 

Mekong Basin (LMB) could produce an 

artificial (“designer”) hydrograph to mitigate 

impacts on downstream fisheries that feed 

millions of people. Modeling studies that 
predict responses of tropical river fisheries to 
hydrological variation are not new, even in the 
LMB (2-4). A study similar to (J) was recently 
undertaken by the Mekong River Commission 
(MRC) (5) using a shorter segment of the same 
CPUE (catch per unit of effort) time series for the 
Cambodian Dai fishery. Hydrology was measured 
in the Tonle Sap River where the fishery takes 
place, not several hundred kilometers away at 
Stung Treng as in (J). 

This MRC study showed that a flood index, 
termed “FPExt” (flood pulse extent) (7), is the 
primary driver of fisheries yields; fish growth 
increases exponentially with the flood index, 
likely reflecting improved feeding opportunities 
(6). The response was modeled “allowing pre- 
dictions to be made of how...fishery yield is like- 
ly to vary under different flooding conditions, 
whether natural, or modified [by] climate change 
and/or water management projects [dams]” (5). 


‘quae Sulis Research Ltd., Midway House, Turleigh, 
Wiltshire BA15 2LR, UK. Department of Wildlife, Fish, and 
Conservation Biology, Center for Watershed Sciences, 
University of California, Davis, CA 95616, USA. 
*Corresponding author. Email: pbmoyle@ucdavis.edu 


Sabo et al. also found this flood index to be a 
significant driver of CPUE, together with a com- 
posite index of dry-season anomalies. However, 
the authors appear to have disregarded the MRC 
findings (5): “...previous work suggests two hypo- 
theses to explain how dams diminish production 
of inland fisheries: (i) by reducing connectivity 
and dispersal, and (ii) by reducing primary pro- 
ductivity through the entrapment of sediment 
supply and associated nutrient delivery from 
headwaters to downstream nursery habitats” (7). 

Sabo e¢ al. explain that their positive relation- 
ship between CPUE and FPExt “...may in part be 
related to increased catchability of fish at high 
water levels.” However, catchability declines in 
large floods because fish density decreases. There- 
fore, fishermen in tropical rivers typically fish 
during the receding flood and dry seasons rather 
than peak flood (7). 

The model of Sabo e¢ al. also does not take into 
account other serious impacts of dams besides 
modifying downstream flows, such as creating 
barriers to fish migrations. Passage through dams 
increases mortality and diminishes reproductive 
success in fish populations. Dams trap nutrient- 
bearing sediments important for maintaining 
primary production and fish habitat, thereby 
diminishing water quality (8, 9). The barrier 
effects of dams alone could end important fish 
migrations in the LMB, irrespective of the avail- 
ability of designer flows (10). 


The “Good” design flow of (1) features large, 
almost rectangular, flood pulses, punctuated by 
long dry seasons, and very rapid transitions— 
approximately 15 days, versus 4 to 5 months under 
natural conditions. Power production is the pri- 
mary function of the Mekong dams, which are 
designed and financed according to predefined 
operating procedures. Existing (and planned) 
dams generate downstream flows that are the 
inverse of the “Good” design flow (i.e., diminished 
flood pulse and elevated dry-season flow). The 
“Good” flow would compromise the economic 
viability of dams in the LMB (11). We estimate 
that Cambodia alone would not have necessary 
storage (approximately 28 km?) to generate the 
“Good” design flow regime (Table 1). Creating 
the “Good” design flow would require coordina- 
tion of dam operations by all six countries in the 
Mekong Basin, only three of which have a major 
stake in the Lower Mekong River fisheries (Table 1). 

Even if the flow could somehow be generated, 
the designer flow regime is an extrapolation of 
an empirical model of a pattern that has never 
been observed in the LMB. The rapid changes in 
flow between dry and wet seasons threaten fish 
populations upon which the Dai and other fisheries 
in the LMB depend (72) by removing migration 
cues, hampering upstream spawning migrations, 
and sweeping downstream-drifting eggs and larvae 
past suitable nursery habitats. Stranding and 
diminished reproductive success under rapidly 
changing water levels will further diminish fish 
catch (13). Other flow-dependent sectors of the 
economy, such as flood recession agriculture (rice 
farming) and navigation, would also be affected 
by these new flow regimes. 
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Table 1. Estimates of live storage in existing and planned dams in the LMB and estimated annual fish catch in each LMB country. Estimates are 
from (14) and (15), respectively. The necessary live storage to generate the “Good” design flow proposed by Sabo et al. (1) is approximately 28 km?. 
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SUSTAINABILITY 


Response to Comments on “Designing 
river flows to improve food security 
futures in the Lower Mekong Basin” 


G. W. Holtgrieve'*, M. E. Arias”, A. Ruhi®, V. Elliott*, So Nam’, Peng Bun Ngor®, 


T. A. Rasinen’, J. L. Sabo®??° 


Sabo et al. presented an empirically derived algorithm defining the socioecological 
response of the Tonle Sap Dai fishery in the Cambodian Mekong to basin-scale variation in 
hydrologic flow regime. Williams suggests that the analysis leading to the algorithm is 
flawed because of the large distance between the gauge used to measure water levels 
(hydrology) and the site of harvest for the fishery. Halls and Moyle argue that Sabo et al’s 
findings are well known and contend that the algorithm is not a comprehensive assessment 
of sustainability. We argue that Williams’ critique stems from a misunderstanding about 
our analysis; further clarification of the analysis is provided. We regret not citing more of the 
work indicated by Halls and Moyle, yet we note that our empirical analysis provides 
additional new insights into Mekong flow-fishery relationships. 


abo et al. (1) presented an empirical anal- 

ysis that mathematically defines the socio- 

ecological response of an ecosystem (fish 

catch) to an increasingly human-controlled 

physical ecosystem driver worldwide (hydrol- 
ogy). This analysis yielded three critical findings: 
(i) The timing of river discharge, as much as the 
magnitude, controls fish catches in the Tonle Sap 
River Dai fishery; (ii) the two aspects of the hy- 
drology most closely related to fish catch—water 
height during the flood and duration of the low- 
flow period—have been changing with existing 
dam development in the direction opposite to 
what promotes fish catches; and (iii) a “designed” 
hydrograph has the potential to maintain or even 
improve fish catches without the constraint of 
maintaining a completely natural hydrologic re- 
gime. Together, these results provide a new per- 
spective on managing flow regimes in heavily 
developed rivers and thus provide directions for 
future research, while simultaneously presenting 
a potentially viable approach for mitigating some 
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of the impacts of dam development on capture 
fisheries in the Mekong River system. 

In his Comment, Williams (2) contends that 
this empirical analysis is flawed because of the 
distance between the river gauging station used in 
the analysis and the location of the fishery, and 
that the “good design” flow scenario presented 
in Sabo et al. is thus unable to be implemented. 
Both questions are addressed below, which we 
believe arise from Williams’ misunderstanding 
of the analysis presented in Sabo et al. In a second 
Comment, Halls and Moyle (3) note that previ- 
ous research by Halls and others has identified 
flood magnitude and duration as important to 
Dai fishery catches, which were not cited by 
Sabo et al. They also question the use of the Stung 
Treng gauging station on the Mekong River for 
our analysis as opposed to the more proximate 
Prek Kdam gauging station on the Tonle Sap 
River. Lastly, Halls and Moyle raise concerns 
about implementation of a designed-flow regime 
and messaging to regional policy-makers that 
designed-flow regimes are a complete solution to 
environmental impacts arising from dam develop- 
ment in the Mekong. We appreciate the contri- 
butions of Williams (2) and Halls and Moyle (3) 
and the opportunity to discuss these issues. 

It is indeed unfortunate that Sabo et al. did not 
cite the full complement of research showing the 
relationship between fishery catches and flood- 
pulse magnitude and duration, as we agree that 
there is substantial literature in support of this 
result. Beyond this, however, Sabo et al. also dis- 
cerned that the magnitude and duration of the 
low-flow period is equally as important to fish- 
eries as the high-flow period. This more novel 
and unexpected result is nonetheless consistent 
with other research and general theories suggest- 
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ing that the production of organic matter and 
nutrients in floodplains during the dry season 
supports the exceptionally high productivity of 
flood-pulse ecosystems observed worldwide (4-6). 
Governing mechanisms and the magnitude of 
nutrient and organic transfer from floodplains to 
fishes remain active areas of research, but current 
information suggests that their importance to 
fishery productivity is substantial [e.g., (7-9)]. 

The multivariate autoregressive modeling ap- 
proach used by Sabo et al. reveals a statistical 
relationship between multiple aspects of hydro- 
logic variance at the Stung Treng gauging station— 
namely, water height at maximum flood and 
duration of the low-flow period—and annual fish 
catches at the Dai fishery on the Tonle Sap River. 
Stung Treng was chosen because it had the longest 
period of record, which was important for quan- 
tifying pre-dam hydrologic conditions. The gaug- 
ing station in the Tonle Sap River at Prek Kdam 
is the closest to the fishery, but the water level 
record was too short to carry out the full analysis. 
The distance between the gauge station and the 
fishery is irrelevant, however, as long as the hy- 
drology at the reference location and at the fish- 
ery location are consistently related to each other. 
The two records need not be identical in shape or 
magnitude in order to be valid for analysis and 
forecasting, as systematic differences in hydrology 
between the locations are inherently captured in 
the empirical analysis. 

The data presented in figure 1 of Williams (2) 
are from the Tonle Sap Lake itself at Kampong 
Luong, yet despite the issues inherent in attempt- 
ing to compare height of water in a lake basin 
to height in a flowing river for only 2 years, the 
data show a high degree of systematic coherence. 
The more appropriate comparison between the 
Mekong stage at Stung Treng and the Tonle Sap 
River stage at Prek Kdam is given in Fig. 1 here. We 
have also extended the comparison to a 27-year 
period to better represent interannual variability. 
This figure shows two important aspects of the sys- 
tem hydrology: (i) the strong coherence between 
the two records (Fig. 1A), and (ii) the nonlinear 
relationship on the falling limb of the hydrograph, 
with stage decreasing faster in the Mekong than 
at Prek Kdam (Fig. 1B). The former validates the 
use of Stung Treng as the reference station in our 
analysis. The latter indicates that because of 
landscape morphometry, declining river stage at 
Stung Treng will precede and be more rapid than 
declines in stage in the Tonle Sap River. Because 
this effect is inherently captured in our analysis, 
it renders the calculation by Williams moot, as the 
“good design” presented in Sabo et al. is for the 
hydrograph at Stung Treng, not at the Tonle Sap 
River, and it is wrong to assume that the two 
stations should have identical hydrographs. In 
short, Williams has made an incorrect assump- 
tion that our “good design” hydrograph is to be 
applied at any location in the basin; it is applica- 
ble at Stung Treng only, with the response of fish 
catch applicable at the Dai fishery only. 

Any meaningful assessment of the implemen- 
tation of designed-flow scenarios in the Mekong 
will involve a complex modeling exercise that 
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Fig. 1. Hydrograph coherence and nonlinearity in the Lower Mekong Basin. (A) River stage 
at Stung Treng on the Mekong River and Prek Kdam on the Tonle Sap River from 2 January 1986 
to 31 December 2008. The correlation coefficient (Pearson r) of logio-transformed stage is 
0.89. (B) The same data plotted against each other, showing a linear relationship in water height 
between the two stations on the rising limb of the hydrograph but a nonlinear relationship on 


the falling limb. 


considers, at least, movement of water across the 
landscape and through time, the number and 
placement of dams throughout the watershed, 
operation rules for electricity production, coor- 
dination (or lack thereof) of dam operations, and 
future climate. Similarly, as Halls and Moyle (3) 
note, multiple other societal objectives beyond 


fisheries—such as rice production, flood miti- 
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gation, biodiversity protection, and electricity 
production—must also be taken into account 
when considering how river hydrology might 
be managed in the future. This was obviously 
beyond the scope of Sabo et al., and continued 
future research is needed to provide a more com- 
prehensive multisectoral analysis. The goal of 
such an analysis should be to present a broader 
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suite of futures to local policy-makers to better 
highlight critical cross-sector interdependencies. 

Finally, as rapid economic development pro- 
ceeds in the Lower Mekong Basin, an altered flow 
regime is one of many ecosystem changes likely 
to have an impact on fish production and fish- 
eries. Sabo et al.’s focus on flow regime does not 
imply that other potential impacts—for example, 
barriers to migration, reduced sediment supply, 
and eutrophication—are unimportant or need 
not be considered. The potential for unintended 
ecosystem consequences of a non-natural flow 
regime must also be further researched and 
fully considered in the decision-making process. 
Designing flow regimes is one area where there 
is scope for management action, however, and 
Sabo et al. provide a method to robustly quantify 
the relationship between flow regime and fish- 
eries catches in a way that managers can use. 
With this, there is now means for formal trade- 
off evaluation useful to policy-makers and a poten- 
tial path toward mitigating some of the impacts 
that dam development will have on lower Mekong 
fisheries. Exactly how designed flow regimes 
might be implemented in the real world will 
require substantial additional research and 
stakeholder engagement. 
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Modeling tissues 
with synthetic biology 


Toda et al., p. 156 


IN SCIENCE JOURNALS ives ty erdietudier 


PLANT SCIENCE 


How rice defeats the 
floodwaters 


eepwater rice varieties grow 
taller when flooded, in a growth 
response driven by the plant 
hormones gibberellin and ethyl- 
ene. This keeps the leaves above 


CREDITS: (FROM TOP) S. TODA ET AL.; RICHARD Il'‘ANSON/GETTY IMAGES 


PHYSICS 
Simulating correlated 
electron systems 


Correlated electron systems are 
generally difficult to simulate 
because of limited capabilities 
of computational resources. 
Harris et al. used a D-Wave 

chip based on a large array 

of superconducting elements 

to simulate the phases of a 
complex magnetic system. They 
tuned the amount of frustration 
within the lattice and varied the 
effective transverse magnetic 
field, which revealed phase tran- 
sitions between a paramagnetic, 
an ordered antiferromagnetic, 
and a spin-glass phase. The 
results compare well to theory 
for this spin-glass problem, 
validating the approach for 


SCIENCE sciencemag.org 


A farmer tends flooded deepwater rice fields inB: 


simulating problems in materi- 
als physics. —ISO 


Science, this issue p. 162 


GEODETIC MONITORING 
Nuclear testing under 
the radar 


North Korea conducted its sixth 
underground nuclear weapons 
test in September 2017. The 
seismic waves generated from 
the test allow for triangulation 
and explosive yield estimates. 
However, Wang et al. show that 
synthetic aperture radar (SAR) 
should be added to the arsenal 
of techniques used to detect 
and characterize nuclear tests. 
SAR tracks deformation from 
space, which resulted in a better 
constraint of source parameters 


by using deformation from the 
nuclear test and the subsequent 
collapse of Mount Mantap. The 
test occurred at a depth of 
about half a kilometer, with an 
explosive yield around 10 times 
that of the Hiroshima explosion. 
—BG 


Science, this issue p. 166 


GREENHOUSE GASES 
Aleaky endeavor 


Considerable amounts of the 
greenhouse gas methane leak 
from the U.S. oil and natural 

gas supply chain. Alvarez et al. 
reassessed the magnitude of 
this leakage and found that in 
2015, supply chain emissions 
were ~60% higher than the 

U.S. Environmental Protection 
Agency inventory estimate. They 
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the water. Kuroha et al. identified the 
genes underlying this phenotype, which 
encode a component of the gibberellin 
biosynthetic pathway and its regulatory 
ethylene-responsive transcription factor. 
This genetic relay drives growth of the 
plant stem internodes in response to 
flooding. Modern cultivated deepwater 
rice, which has been domesticated for 
adaptation to the monsoon season of 
Bangladesh, emerged from the genetic 
variation found in wild rice strains over a 
broader geographic region. —PJH 


Science, this issue p.181 


suggest that this discrepancy 
exists because current inven- 
tory methods miss emissions 
that occur during abnormal 
operating conditions. These data, 
and the methodology used to 
obtain them, could improve and 
verify international inventories of 
greenhouse gases and provide a 
better understanding of mitiga- 
tion efforts outlined by the Paris 
Agreement. —HJS 


Science, this issue p. 186 


EVOLUTIONARY COGNITION 
The impact of time 
wasted 


The amount of time already 
spent on a task influences 
human choice about whether 
to continue. This dedicated 
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time, known as the “sunk cost,” 
reduces the likelihood of giving 
up the pursuit of a reward, even 
when there is no indication 
of likely success. Sweis et al. 
show that this sensitivity to 
time invested occurs similarly 
in mice, rats, and humans (see 
the Perspective by Brosnan). 
All three display a resistance 
to giving up their pursuit of a 
reward in a foraging context, but 
only after they have made the 
decision to pursue the reward. 
—SNV 

Science, this issue p. 178; 

see also p.124 


EARLY DEVELOPMENT 
It takes two to tango 


Fusion of egg and sperm 
combines the genetic material 
of both parents in one cell. In 
mammals, including humans, 
each parental genome is initially 
confined in a separate pro- 
nucleus. For the new organism 
to develop, the two genomes 
must be spatially coordinated 
so that the first embryonic divi- 
sion can create two cells that 
combine both genomes in one 
nucleus. Reichmann et al. found 
that at the beginning of the 

first division, two microtubule 
spindles organize the maternal 
and paternal chromosomes and 
subsequently align to segre- 
gate the parental genomes in 
parallel (see the Perspective by 
Zielinska and Schuh). Failure 

of spindle alignment led to 
two-celled embryos with more 
than one nucleus per cell. Dual- 
spindle assembly in the zygote 


Dual spindles separate maternal and 
paternal chromosomes in parallel. 
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thus offers a potential mecha- 
nistic explanation for division 
errors frequently observed in 
human embryos in the fertility 
clinic. -SMH 
Science, this issue p. 189; 
see also p.128 


HEPATITIS C VIRUS 
Hampering HCV 


transmission 
No hepatitis C virus (HCV) 
vaccine is currently available. 
Furthermore, evidence from 
tudies in nonhuman primates 
uggests that any future human 
HCV vaccine would be unlikely 
to induce complete immunity. 
Major et al. examined whether 
lowered HCV titers potentially 
resulting from an imperfect 
vaccine might still stem HCV 
transmission in people who 
inject drugs. They measured 
the HCV RNA from infected 
human plasma retained in con- 
taminated needles and syringes. 
Their mathematical model 
combining these measurements 
with published data on HCV 
viral kinetics suggested that a 
partially effective vaccine could 
reduce the HCV transmission 
risk from sharing contaminated 
needles and syringes. —CAC 

Sci. Trans!. Med. 10, eaao4496 (2018). 


S 
S 


CELL BIOLOGY 
An inflammatory way to 
activate AMPK 


In its role as a sensor of energy 
status, the kinase AMPK is 
activated by phosphorylation 
mediated by the tumor sup- 
pressor LKB1. Antonia and 
Baldwin found that AMPK 
activation could also be medi- 
ated by a pathway involving 
TAKI1, a kinase associated with 
inflammatory pathways, and its 
target, IKK. IKK phosphorylated 
AMPK independently of LKB1. 
Combining an IKK inhibitor with 
the cancer drug phenformin 
improved the drug's ability to 
kill LKB1-deficient cancer cells, 
highlighting a potential treat- 
ment for cancers lacking this 
tumor suppressor. —WW 

Sci. Signal. 11, eaan5850 (2018). 


Ws 


IN OTHER JOURNALS 


EMERGING THERAPIES 
Gene editing gets a 
head start 


The development of gene- 
editing technologies into 
therapies for human disease 
is an exciting prospect. A 
crucial question is whether 
there are advantages to 
correcting disease-causing 
mutations before rather than 
after birth, and whether this 
approach is even feasible. In 
a proof-of-concept study in 
mice, Ricciardi et al. accom- 
plished successful in utero 
correction of a hemoglobin 
gene mutation that causes 
§-thalassemia, a serious 
blood disorder. They injected 
nanoparticles containing 
gene-editing machinery (tri- 
plex-forming peptide nucleic 
acids and single-stranded 
donor DNA) intravenously 
into midgestational mouse 
fetuses. After birth, the 
treated mice showed sus- 
tained amelioration of anemia 
and survived longer than 
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untreated mice. —PAK 
Nat. Commun. 10.1038/s41467-018- 
04894-2 (2018). 


CANCER 


Finding the precise drug 
Treating cancer according 
to the molecular makeup of 
tumor cells is challenging. 
How can we find a target that 
will kill tumor cells and match 
it to an effective drug? Alvarez 
et al. analyzed the protein- 
signaling networks in human 
gastroenteropancreatic neuro- 
endocrine tumors (GEP-NETs) 
to find master regulators of 
tumor survival. Subsequent 
screening of GEP-NET-derived 
cells with various drugs 
confirmed that their top hit, 
entinostat, inhibited GEP-NET 
growth in vivo. This approach 
has the potential to provide 
unbiased precision therapy for 
many cancers, especially rare 
cancers, such as GEP-NETs. 
—GKA 

Nat. Genet. 10.1038/s41588-018- 

0138-4 (2018). 
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QUANTUM OPTICS 
Single photons for 
optic fibers 


Being robust, fast, and able to 
encode information in a number 
of different ways, single photons 
are ideal carriers of quantum 
information. Exploiting the vast 
optic fiber network to route 
single photons between distant 
points, however, is hampered 


MICROBIOTA 


Cell-to-cell action in cholera 


ut bacteria are competitive. They jostle 
for niches and nutrients by means of a 
variety of mechanisms. Gram-negative 
bacteria devote a lot of resources 
to the comparatively large (up to 
600 nm) type 6 secretion system (T6SS) 
used to stab neighboring bacterial and 
host cells. Contrary to expectations, Fast 
et al. discovered that commensal bacteria 
are not always protective against T6SS- 
toting pathogens. Vibrio cholerae T6SS 
contributes to host death in the fly model 
of cholera. Acetobacter pasteurianus is a 
common, normally harmless, commensal 
in the fly gut. If A. pasteurianus is removed, 
surprisingly, V. cholera is enfeebled, too. 
This effect seems to be mediated by signal- 
ing through the immune deficiency (IMD) 
pathway of the fly. The IMD is “primed” by 
the commensal organism. —CA 
Proc. Natl. Acad. Sci. U.S.A. 10.1073/ 
pnas.1802165115 (2018). 


The pathogenicity of Vibrio cholerae (illustrated at left) 
is reduced by removal of a commensal bacterium. 


by the lack of single-photon 
sources operating at telecom 
wavelengths. Dibos et al. show 
that single erbium ions doped 

in a solid-state host material 
produce single photons at just 
the right wavelength for long- 
distance transmission. They use 
a silicon-based nanophotonic 
crystal in close proximity to a 
doped ion to enhance the extrac- 
tion of single photons and to 


Schematic illustration of silicon waveguides and a YSO crystal 


SCIENCE sciencemag.org 


couple them in an optic fiber. 
The results demonstrate the 
basis of an architecture to 
develop long-distance quan- 
tum optical networks. —ISO 
Phys. Rev. Lett. 120, 243601 (2018). 


CELL BIOLOGY 


mTORC1 jams cell traffic 


The cytoplasm of a cell is 
stuffed with large molecules 
and organelles. Such crowding 
may cause some molecules 
to aggregate, which could in 
turn affect their biological 
function. To monitor cytoplas- 
mic crowding, Delarue et al. 
made a probe from bacte- 

rial proteins that form large 
scaffolds linked to fluorescent 
proteins. In yeast and human 
cells, the probes revealed that 
the protein kinase com- 

plex mTORC1 (mechanistic 
target of rapamycin complex 
1) enhanced cytoplasmic 
viscosity. It did so by increas- 
ing production of ribosomes 
while inhibiting their degra- 
dation. Because ribosomes 
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occupy about 20% of a cell's 
volume, mTORC1 activity tunes 
cytoplasmic crowding and 
thus influences cell physiology. 
—LBR 
Cell 10.1016/}.cell.2018.05.042 
(2018). 


CARBON CAPTURE 


Help wanted 
Preventing global surface air 
temperatures from rising too 
high will require emissions 
to be reduced and CO, to be 
removed (captured) from 
the air. Many ways to do this 
have been proposed, although 
none to date are both tech- 
nologically and economically 
feasible, so new and better 
strategies are needed. Rau et 
al. suggest that generating H, 
with a combination of saline 
water electrolysis and mineral 
weathering, powered by elec- 
tricity not derived from fossil 
fuels, could greatly increase 
energy generation and CO, 
removal, at a lower cost than 
methods involving biomass 
energy plus carbon capture 
and storage. This approach 
also would allow carbon to 
be sequestered as long-lived 
ocean alkalinity rather than as 
concentrated CO,. —HJS 

Nat. Clim. Change 8, 621 (2018) . 


THERMOELECTRICS 
Lower symmetry for 
higher performance 


Thermoelectric materials inter- 
change heat and electricity. 
Lowering thermal conductivity 
while maintaining electrical 
conductivity is important for 
developing promising ther- 
moelectric materials. Li et al. 
depart from the usual strategy 
of using high-symmetry 
materials by moving from 
cubic to rhombohedral GeTe, 
which ends up boosting the 
thermoelectric performance. 
The performance metric called 
the figure of merit is 2.4 at 600 
K. The general strategy may be 
applicable to other materials, 
providing another pathway to 
improve performance. —BG 
Joule 2,976 (2018) 
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PLANT SCIENCE 
Genomic traces of 
symbiosis loss 


A symbiosis between cer- 
tain bacteria and their plant 
hosts delivers fixed nitrogen 
to the plants. Griesmann et 
al. sequenced several plant 
genomes to analyze why nitro- 
gen-fixing symbiosis is irregularly 
scattered through the evolution- 
ary tree (see the Perspective by 
Nagy). Various genomes carried 
traces of lost pathways that 
could have supported nitrogen- 
fixing symbiosis. It seems that 
this symbiosis, which relies on 
multiple pathways and complex 
interorganismal signaling, is sus- 
ceptible to selection and prone 
to being lost over evolutionary 
time. —PJH 

Science, this issue p. 144; 

see also p.125 


NEUTRINO ASTROPHYSICS 
Neutrino emission from a 
flaring blazar 


Neutrinos interact only very 
weakly with matter, but giant 
detectors have succeeded in 
detecting small numbers of 
astrophysical neutrinos. Aside 
from a diffuse background, 
only two individual sources 
have been identified: the Sun 
and a nearby supernova in 
1987. A multiteam collaboration 
detected a high-energy neutrino 
event whose arrival direction 
was consistent with a known 
blazar—a type of quasar with a 
relativistic jet oriented directly 
along our line of sight. The bla- 
zar, TXS 0506+056, was found 
to be undergoing a gamma-ray 
flare, prompting an extensive 
multiwavelength campaign. 
Motivated by this discovery, the 
IceCube collaboration examined 
lower-energy neutrinos detected 
over the previous several years, 
finding an excess emission at 
the location of the blazar. Thus, 
blazars are a source of astro- 
physical neutrinos. —KTS 
Science, this issue p. 147, p. 146 
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SYNTHETIC BIOLOGY 
Engineering multilayered 
cellular structures 


The ability to program the manu- 
facture of biological structures 
may yield new biomaterials or 
synthetic tissues and organs. 
Toda et al. engineered mam- 
malian “sender” and “receiver” 
cells with synthetic cell surface 
igands and receptors that 
controlled gene regulatory 
circuits based on Notch signal- 
ing. Programming the cells to 
express cell adhesion molecules 
and other regulatory molecules 
enabled spontaneous forma- 
tion of multilayered structures, 
like those that form during 
embryonic development. The 
three-layered structures even 
showed regeneration after injury. 
—LBR 


Science, this issue p.156 


FERROELECTRICITY 
Perovskites go organic 


The perovskite structure 
accommodates many differ- 
ent combinations of elements, 
making it attractive for use in 
a wide variety of applications. 
Building perovskites out of only 
organic compounds is appeal- 
ing because these materials 
tend to be flexible, fracture- 
resistant, and potentially 
easier to synthesize than their 
inorganic counterparts. Ye et al. 
describe a previously unknown 
family of all-organic perovskites, 
of which they synthesized 23 
different family members (see 
the Perspective by Li and Ji). 
The compounds are attractive 
as ferroelectrics, including one 
compound with properties close 
to the well-known inorganic fer- 
roelectric BaTiO,. —BG 

Science, this issue p. 151; 

see also p. 132 


ORGANIC CHEMISTRY 
Asilver cleaver splits 
cyclic amines 


Carbon-carbon single bonds 
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are fairly unreactive when they 
are not strained ina tight ring. 
Roque et al. now report that a 
silver salt can cleave C—C bonds 
in unstrained cyclic amines such 
as pyrrolidines and piperidines. 
Paired with an electrophilic 
luorine source in aqueous solu- 
tion, the silver first oxidizes the a 
arbon adjacent to the nitrogen. 
Ring-opening fluorination of the 
8 carbon then proceeds by an 
apparent radical mechanism. 
The reaction offers a versatile 
means of introducing fluorine 
to structural motifs common in 
pharmaceutical research. —JSY 
Science, this issue p. 171 


+ 


EARLY OCEAN 
The rise of oxygen 


To understand the evolution of 
the biosphere, we need to know 
how much oxygen was present 
in Earth's atmosphere during 
most of the past 2.5 billion years. 
However, there are few proxies 
sensitive enough to quantify O, 
at the low levels present until 
slightly less than 1 billion years 
ago. Lu et al. measured iodine/ 
calcium ratios in marine car- 
bonates, which are a proxy for 
dissolved oxygen concentrations 
in the upper ocean. They found 
that a major, but temporary, rise 
in atmospheric O, occurred at 
around 400 million years ago 
and that O, levels underwent 

a step change to near-modern 
values around 200 million years 
ago. —HJS 


Science, this issue p. 174 


INFECTIOUS DISEASES 
Tracking parasite decline 


Neglected tropical diseases, 
such as malaria, are often 
caused by parasites, the 
population dynamics of which 
are difficult to track. Control 
programs that are aimed at 
eradicating these diseases would 
benefit from an ability to monitor 
parasite populations to assess 
efficacy and tailor responses. 

In a Perspective, Cotton et al. 
discuss the use of genomics to 


Published by AAAS 


track parasite populations and 
their evolution in the context of 
control programs. They also dis- 
cuss how conservation genomics 
can inform us about the dynam- 
ics of extinction. —-GKA 

Science, this issue p. 130 


HUMAN GENETICS 
Methylation patterns in 
early embryos 


The methylation state of an 
embryo can be influenced 
by its environment, although 
whether this affects a person's 
health later in life is unclear. 
A mother's diet can influence 
genomic regions that vary 
greatly in methylation state, 
known as metastable epialleles 
(MEs). Kessler et al. performed 
genome-wide screening on 687 
candidate MEs in preimplanta- 
tion embryos of North American 
Caucasians. Compared with ME 
patterns previously observed 
ina Gambian population and in 
Chinese embryos, regions of the 
embryonic genome that are sen- 
sitive to season of conception 
harbored many MEs and atypical 
methylation patterns. Thus, MEs 
are sensitive to genetic and envi- 
ronmental features. —PJB 

Sci. Adv. 10.1126/sciadv.aat2624 

(2018). 


THYMUS 
Metabolic signaling 
circuits in thymocytes 


Cell differentiation is often 
accompanied by metabolic 
changes. Yang et al. report that 
generation of double-positive 
(DP) thymocytes, which have 
both CD4 and CD8 antigens 

on their surface, from double- 
negative (DN) thymocytes 
coincides with dynamic regula- 
tion of glycolytic and oxidative 
metabolism. Given the central 
role of mTORC1 (mechanistic 
target of rapamycin complex 1) 
signaling in regulating metabolic 
changes, the authors examined 
the role of the mTORC]1 pathway 


sciencemag.org SCIENCE 
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in thymocyte development. 
They conditionally deleted 
RAPTOR, a key component of 
the mTORC1 complex. Loss of 
RAPTOR impaired the DN-to-DP 
transition, but unexpectedly also 
perturbed the balance between 
aB and y8 T cells. —AB 


Sci. !mmunol. 3, eaas9818 (2018). 


STRUCTURAL DYNAMICS 


Look fast 


Organisms from bacteria to 
humans sense and react to 
light. Proteins that contain the 
light-sensitive molecule retinal 
couple absorption of light to 
conformational changes that 
produce a signal or move ions 
across a membrane. Nogly et 
al. used an x-ray laser to probe 
the earliest structural changes 
to the retinal chromophore 
within microcrystals of the ion 
pump bacteriorhodopsin (see 
the Perspective by Moffat). The 
excited-state retinal wiggles but 
is held in place so that only one 
double bond of retinal is capable 
of isomerizing. A water molecule 
adjacent to the proton-pumping 
Schiff base responds to changes 
in charge distribution in the 
chromophore even before the 
movement of atoms begins. 
—MAF 

Science, this issue p. 145; 

see also p. 127 
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Phylogenomics reveals multiple losses 
of nitrogen-fixing root nodule symbiosis 


Maximilian Griesmann, Yue Chang, Xin Liu, Yue Song, Georg Haberer, 

Matthew B. Crook, Benjamin Billault-Penneteau, Dominique Lauressergues, 

Jean Keller, Leandro Imanishi, Yuda Purwana Roswanjaya, Wouter Kohlen, 

Petar Pujic, Kai Battenberg, Nicole Alloisio, Yuhu Liang, Henk Hilhorst, 

Marco G. Salgado, Valerie Hocher, Hassen Gherbi, Sergio Svistoonoff, Jeff J. Doyle, 
Shixu He, Yan Xu, Shanyun Xu, Jing Qu, Qiang Gao, Xiaodong Fang, Yuan Fu, 
Philippe Normand, Alison M. Berry, Luis G. Wall, Jean-Michel Ané, 

Katharina Pawlowski, Xun Xu, Huanming Yang, Manuel Spannagl, Klaus F. X. Mayer, 
Gane Ka-Shu Wong, Martin Parniske*, Pierre-Marc Delaux*, Shifeng Cheng* 


INTRODUCTION: Access to nutrients such as 
nitrogen is required for plant growth. Legumes 
and nine additional plant families benefit from 
the nitrogen-fixing root nodule (NFN) symbio- 
sis, in which roots develop nodules that intra- 
cellularly host nitrogen-fixing bacteria. In this 
mutually beneficial symbiosis, the bacteria con- 
vert atmospheric nitrogen into ammonium and 
deliver it to the host plant. NFN symbiosis thus 
enables plant survival under nitrogen-limiting 
conditions in terrestrial ecosystems. In agricul- 
ture, this symbiosis reduces reliance on nitro- 
gen fertilizer, thus reducing the costs, ecological 
impact, and fossil fuel consumption attendant 
on large-scale application of fertilizers. 


RATIONALE: Molecular phylogenies show 
that NFN symbiosis is restricted to four angio- 
sperm orders—Fabales, Fagales, Cucurbitales, 
and Rosales—that together form the mono- 


phyletic NFN clade. However, only 10 of the 
28 plant families within this clade contain 
species engaged in the NFN symbiosis. Even 
within these 10 families, most genera do not 
form this symbiosis. The NFN symbiosis re- 
quires the coordinated function of more than 
30 essential genes. Presence of this symbiosis 
in related families suggests that a genetic change 
in the ancestor of the NFN clade enabled evo- 
lution of NFN symbiosis in this clade. The 
scattered distribution of functional NFN sym- 
biosis across the clade has led to the question 
of whether NFN symbiosis evolved multiple 
times independently in a convergent manner 
or was lost multiple times regardless of the 
number of times it arose. Fossil data have 
been unable to answer this question. Here 
we used molecular evidence to ask how the 
current pattern of plant species with NFN sym- 
biosis evolved. 


RESULTS: We sequenced the genomes of sev- 
en nodulating species belonging to the Fagales, 
Rosales, and Cucurbitales orders and the legume 
subfamily Caesalpinioideae. We complemented 
this dataset by sequencing three genomes of non- 
nodulating species from the Cucurbitales and 
from the legume subfamilies Cercidoideae and 
Papilionoideae. Using a genome-wide phyloge- 
nomic approach, we found that all legume genes 


with a characterized role 


in NEN symbiosis are con- 
Read the full article served in nodulating spe- 
at http://dx.doi. cies with one exception. We 
org/10.N26/ observed larger numbers 
SIDE ASS of order-specific gene fam- 


ily expansions that, solely 
because of their phylogenetic distribution, may 
include genes contributing to multiple gains or 
subsequent refinements of the symbiosis. In 
parallel, we discovered signatures of multiple 
independent loss-of-function events for the gene 
encoding the indispensable NFN symbiosis 
regulator NODULE INCEPTION (NIN) in 10 of 
13 genomes of nonnodulating species within 
the NFN clade. The pattern suggests at least 
eight independent losses of NFN symbiosis. 


CONCLUSION: We found that multiple inde- 
pendent losses of NFN symbiosis occurred in 
the four orders of the NEN clade. These results 
suggest that NFN symbiosis has previously been 
more common than currently evident and that 
this symbiosis is subject to an underestimated 
adverse selection pressure. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: chengshf@genomics.cn (S.C.); 
pierre-marc.delaux@Irsv.ups-tlse.fr (P.-M.D.); parniske@Imu.de 
(M.P.) 

Cite this article as M. Griesmann et al., Science 361, eaat1743 
(2018). DOI: 10.1126/science.aatl743 
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Phylogenomics and evolution of NFN symbiosis. Genome sequencing of nodulating and nonnodulating species combined with 27 previously available 
genomes resulted in a dataset spanning the NFN clade and species outside the NFN clade as an outgroup. Orthogroups were identified and filtered 
following three phylogenetic patterns. This genome-wide analysis identified two genes involved in NFN symbiosis, NIN and RHIZOBIUM-DIRECTED POLAR GROWTH 
(RPG), that were lost in most nonnodulating species. The occurrence of multiple losses (red crosses) of NFN symbiosis suggests an adverse selection pressure. 
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PLANT SCIENCE 


Phylogenomics reveals multiple 
losses of nitrogen-fixing root 
nodule symbiosis 


Maximilian Griesmann’”, Yue Chang®*, Xin Liu®*, Yue Song®**, Georg Haberer”, 
Matthew B. Crook’, Benjamin Billault-Penneteau’, Dominique Lauressergues®, 

Jean Keller®, Leandro Imanishi’, Yuda Purwana Roswanjaya®, Wouter Kohlen®, 

Petar Pujic®, Kai Battenberg’°, Nicole Alloisio®, Yuhu Liang®’*, Henk Hilhorst™”, 

Marco G. Salgado’”, Valerie Hocher", Hassen Gherbi’*, Sergio Svistoonoff!*, 

Jeff J. Doyle™*, Shixu He**, Yan Xu**, Shanyun Xu**, Jing Qu®*, Qiang Gao*”’, 
Xiaodong Fang*””’, Yuan Fu**, Philippe Normand”, Alison M. Berry’°, Luis G. Wall’, 
Jean-Michel Ané’®?”, Katharina Pawlowski'”, Xun Xu***, Huanming Yang®"®, 

Manuel Spannagl’, Klaus F. X. Mayer””?, Gane Ka-Shu Wong”?°”’, Martin Parniske™, 
Pierre-Marc Delaux®*, Shifeng Cheng*** 


The root nodule symbiosis of plants with nitrogen-fixing bacteria affects global nitrogen 
cycles and food production but is restricted to a subset of genera within a single clade of 
flowering plants. To explore the genetic basis for this scattered occurrence, we sequenced 
the genomes of 10 plant species covering the diversity of nodule morphotypes, bacterial 
symbionts, and infection strategies. In a genome-wide comparative analysis of a total of 
37 plant species, we discovered signatures of multiple independent loss-of-function events 


in the indispensable symbiotic regulator NODULE INCEPTION in 10 of 13 genomes of 
nonnodulating species within this clade. The discovery that multiple independent losses 
shaped the present-day distribution of nitrogen-fixing root nodule symbiosis in plants 
reveals a phylogenetically wider distribution in evolutionary history and a so-far-underestimated 


selection pressure against this symbiosis. 


itrogen is one of the main requirements 

for plant growth. Members of the legume 

family (Fabaceae, order Fabales) and of 

nine additional plant families benefit from 

symbiosis with nitrogen-fixing bacteria, 
either phylogenetically diverse rhizobia or species 
of the genus Frankia, which are hosted inside 
plant cells found within nodules—specialized 
host-derived lateral organs typically found on 
roots. In this mutualistic nitrogen-fixing root 
nodule (NFN) symbiosis, intracellular bacteria 
convert atmospheric nitrogen into ammonium 
by means of the enzyme nitrogenase (7). This 
“fixed nitrogen,” delivered to the host plant, is 
an essential building block for amino acids, DNA, 
RNA, tetrapyrroles such as chlorophyll, and many 
other molecules. This symbiosis enables plant 


survival under nitrogen-limiting conditions. In 
agriculture, this independence from chemical 
nitrogen fertilizer reduces costs and fossil fuel 
consumption imposed by the Haber-Bosch pro- 
cess (2). Since the discovery of the NFN symbi- 
osis, with rhizobia in 1888 and with Frankia in 
1895 (3, 4), it has been unclear why it is restricted 
to only a limited number of flowering plant species. 

A major scientific step forward in our under- 
standing was the reorganization of the phyloge- 
netic tree of angiosperms in 1995, which revealed 
that plants forming the NFN symbiosis are re- 
stricted to the Fabales, Fagales, Cucurbitales, 
and Rosales orders, which together form the NFN 
clade (5, 6). However, this reorganization also 
raised new questions, because only 10 of the 28 
plant families within the NFN clade contain plants 


that form nodules (referred to here as “nodulating 
species”) and these do not form a monophyletic 
group; moreover, within 9 of these 10 families, 
most genera do not form NFN symbiosis (7). In 
addition to this scattered distribution, a further 
unsolved mystery that surrounds the evolution 
of NEN symbiosis is its diversity at multiple levels: 
Legumes (Fabales) and the nonlegume Parasponia 
(Rosales) (8) form nodules with rhizobia, whereas 
species of the actinobacterial genus Frankia in- 
fect actinorhizal plants from eight plant families 
of the orders Fagales, Cucurbitales, and Rosales 
(9). A diversity of infection mechanisms has been 
described (9), and root nodule structures display 
wide variations (5, 7). The most parsimonious 
hypothesis to explain the restricted and yet 
scattered distribution pattern of such diverse 
NEN symbioses predicted a genetic change in 
the ancestor of the NFN clade, a predisposition 
event, that enabled the subsequent independent 
evolution of NFN symbiosis specifically and ex- 
clusively in this clade (the multiple-origins hypo- 
thesis), along with a number of losses (5, 7, 10-12). 
Recent quantitative phylogenetic modeling studies 
supported scenarios with independent gains and 
switches between the nonnodulating and nod- 
ulating states during the evolution of the NFN 
clade (11, 13). However, none of these analyses 
provide direct evidence or the molecular causes 
of the specific gains and losses that explain the 
distribution of NFN symbiosis in extant genera. 

Exploring the genetic basis underlying the 
evolutionary dynamics of the NFN symbiosis in 
plants will improve our understanding of the di- 
versity of symbiotic associations observed in ex- 
tant taxa and the ecosystems they inhabit and 
potentially provide keys to engineer it in crops 
and to predict the stability of this trait over long 
evolutionary times. Here we used a genome-wide 
comparison including genomic and phylogenomic 
methods (/4) to address the long-standing con- 
undrum of the evolution of NFN symbiosis and 
identify the underlying genetic players (15, 16). 


Results 
Genome sequencing in the NFN clade 


Sequenced genomes of nodulating species were 
only available for a few agriculturally relevant 
legume species belonging to a single subfamily 
(Papilionoideae), all derived from a single pre- 
dicted evolutionary origin of NFN symbiosis, with 
no representation either of taxa representing 
possible additional origins within legumes or of 
nonlegume nodulating species widely accepted 
as representing multiple additional origins (17-19). 


‘Faculty of Biology, Genetics, LMU Munich, GroBhaderner Strasse 2-4, 82152 Martinsried, Germany. Plant Genome and Systems Biology (PGSB), Helmholtz Center Munich-German Research 
Center for Environmental Health, 85764 Neuherberg, Germany. °BGI-Shenzhen, Shenzhen 518083, China. “China National GeneBank, BGI-Shenzhen, Shenzhen 518120, China. Department of 
Microbiology, Weber State University, Ogden, UT 84408-2506, USA. °Laboratoire de Recherche en Sciences Végétales (LRSV), Université de Toulouse, CNRS, UPS, 24 chemin de Borde Rouge, 
Auzeville, BP42617, 31326 Castanet Tolosan, France. LBMIBS, Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes, CONICET, R. Saénz Pefia 352, B1876BXD Bernal, 
Argentina. °Laboratory for Molecular Biology, Wageningen University, Droevendaalsesteeg 1, 6708 PB Wageningen, Netherlands. °Université Lyon 1, Université de Lyon, CNRS, Ecologie 
Microbienne, UMR 5557, Villeurbanne, 69622 Cedex, France. Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA. “Laboratory for Plant Physiology, Wageningen 
University, Droevendaalsesteeg 1, 6708 PB Wageningen, Netherlands. ‘Department of Ecology, Environment and Plant Sciences, Stockholm University, 106 91 Stockholm, Sweden. French 
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Conversely, sequenced genomes of nonnodulating 
species were restricted to the Fagales, Rosales, and 
Cucurbitales and did not include nonnodulating 
legume taxa (table S1). To overcome this sampling 
bias, which restricted the phylogenomic analysis, 
we sequenced de novo the genomes of seven 
nodulating species belonging to the Cucurbitales, 
Fagales, and Rosales orders and the Caesalpi- 
nioideae subfamily of the Fabaceae, representing 
a possible second origin of NFN symbiosis in 
legumes. Three nonnodulating species from the 
NEN clade were also sequenced, notably Nissolia 
schotii, a papilionoid legume that has lost the 
ability to form the NFN symbiosis and which 
therefore provides insights on the genomic con- 
sequences of losing the symbiosis (Fig. 1 and 
fig. S1). For each species, 144: to 381 Gb of Illumina 
reads were obtained, covering the estimated 
genomes at least 189-fold and up to 1113-fold, 
with the resulting scaffold N50 length between 
96 kb and 1.18 Mb and an average genome 
completeness of 96% (Fig. 1, figs. Sl and $2, and 
tables S2 to S29). Altogether, the genomes of 
species sequenced here represent six families 
and most known nodule anatomy types and root 
infection pathways; include hosts for the main 
classes of nodule-inducing symbiotic nitrogen- 
fixing a-proteobacteria, B-proteobacteria, and 
actinobacteria; and cover six to seven independent 


Fig. 1. Genome features of 
species used in this study. 
Genome statistics are shown as 
pictograms for species used 

in this study, with nodulating 
species highlighted by blue 
sectors. Species names are 
shown as four-letter abbreviations 
at the outer circle, with their 
taxonomic order color coded as 
shown in the top-right legend. 
Newly sequenced species are in 
bold blue letters. The next two 
circles show, as pie charts, the 
proportion of complete BUSCO 
genes [see (68)] detected in the 
genome assembly (light gray) 
and the percentage of assembled 
sequence relative to the 
estimated genome size (dark 
gray), respectively. Scaffold 

N50 values are depicted as 
bubble charts (black) capped to a 
maximal N50 of 1 Mb to reduce 
graphical biases by finished 
genomes assembled to 
pseudochromosomes. Note 

that even for assemblies in this 


evolutionary origins of NFN symbiosis, according 
to the multiple-gains hypothesis (5, 7, 10, 11). 
These 10 sequenced genomes, together with 18 
other genomes from the NFN clade and 9 gen- 
omes from other flowering plants as an out- 
group (Fig. 1), were compared to detect molecular 
traces supporting any of the three postulated 
events in the evolution of NFN symbiosis: (i) 
predisposition to evolve it, (ii) multiple inde- 
pendent gains, and (iii) multiple independent 
losses of NFN symbiosis. 


The putative predisposition event 

did not involve NFN clade-specific 

gene gains 

The predisposition event postulated by Soltis et al. 
in 1995 (5) may be based on the acquisition of 
one or several genes or sequence modifications 
specific to the NFN clade. This acquisition would 
be also consistent with a single-origin hypothesis, 
in which NFN symbiosis in all taxa is predicted 
as a homologous trait. Genes acquired during the 
predisposition event are expected to be specific 
to the NFN clade and present in all nodulating 
species. To search for genes following this evol- 
utionary pattern, we identified gene families 
across all 37 plant genomes in our dataset using 
the Orthofinder pipeline (20). For each of the 
resulting 29,433 gene family clusters, we cal- 


culated a separate phylogeny and subsequently 
inferred orthologs for all genes of the reference 
species, Medicago truncatula (16). We selected 
groups for which orthologs were absent in the 
nine species outside of the NFN clade but were 
retained in nodulating species (fig. S3). To ob- 
tain a candidate set for manual validation from 
the total of 29,213 orthologous groups, we used 
an automated filtering approach with relaxed 
criteria, which allowed for the absence of ortho- 
logs in a small subset of nodulating species 
(16). This step was necessary to avoid the loss of 
putative candidates owing to missed gene models 
resulting in false negatives, as they often occur in 
automated gene-prediction pipelines. Our re- 
laxed filter identified a total of 31 orthologous 
candidate groups (table $30). All of these can- 
didate groups underwent an iterative manual 
curation, including a search for missed gene 
models and recalculation of phylogenies and 
orthologs (16). 

Not a single candidate gene was identified 
that matched the evolutionary pattern expected 
for predisposition-related genes, suggesting that 
genes gained in the most recent common an- 
cestor of the NFN clade have been conserved or 
lost irrespectively of the symbiotic state of the 
lineages. If the predisposition indeed occurred, 
this result indicates that it did not involve the 
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acquisition of genes but rather the co-option 
of existing genes and their corresponding 
pathways. 


Gene family dynamics is compatible 
with multiple gains 


Multiple molecular mechanisms leading to the 
convergent evolution of a trait have been iden- 
tified (21). Deep homology (22), the independent 
recruitment of a homologous gene set for the 
development of nonhomologous traits, has been 
proposed for the NFN symbiosis (7, 23). Indeed, 
several genes initially identified for their sym- 
biotic role in legumes were later found to also 
play a symbiotic role in Fagales (24-27), Rosales 
(28, 29), and Cucurbitales (30). An alternative 
mechanism for the evolution of new traits is gene 
family expansion, as exemplified by the parallel 
diversification of the zinc-finger transcription fac- 
tor family in the evolution of a dominant yeast 
form in fungi (37) or the acquisition of strigolactone 
perception in parasitic plants (32). We hypothe- 
sized that if NFN symbiosis evolved multiple 
times, independent expansions in the same gene 
family might have been involved. Given that our 
dataset covers six to seven of the predicted in- 
dependent gains of NFN symbiosis, it allowed us 
to search for gene families whose evolutionary 
patterns are consistent and would support the 
multiple-gains hypothesis (7). We analyzed Ortho- 
finder clusters for copy-number variation to iden- 
tify gene family expansion events at each of these 
nodes (Fig. 2) (16). Multiple alternative models 
have been proposed for the independent gain 
of NFN symbiosis. In one scenario (7), a single 
gain before the radiation of the legume family 
has been suggested that correlates with the ex- 
pansion of 33 clusters in our analysis (Fig. 2). To 
test whether any of these clusters were en- 
riched with differentially expressed genes (DEGs) 
in nodule tissue versus root tissue, we derived for 
M. truncatula (Medicago) gene expression data 
for both conditions from the gene expression atlas 
(33) and tested for an enrichment of Medicago 
DEGs in each cluster (table S31). We only found 
one such cluster that belongs to the nitrate 
transporter family NRT1/PTR (34). However, 
this gene family appears to also be expanded at 
nodes not related to independent gains within 
and outside of the NFN clade (table $31). An 
alternative model proposes two independent 
gains within the legumes: the first at the most 
recent common ancestor of the papilionoids and 
the clade to which Castanospermum australe 
belongs (21-291 enriched-expanded clusters), or at 
the base of the papilionoids (4-32), and the second 
gain at the base of the caesalpinioid and mimosoid 
clade (7-92). This clade could alternatively com- 
prise two independent gains for Chamaecrista 
fasciculata (39-710) and mimosoids (37-723). 
Larger numbers of expanded gene families were 
observed for the predicted events in the Rosales, 
Fagales, and Cucurbitales (Fig. 2 and table S31). 
Taken together, we found 52 gene family clusters 
that were enriched with differentially expressed 
genes in Medicago nodules and expanded mul- 
tiple times at proposed independent gain nodes. 
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However, similar to the NRT1/PTR family, all of 
the enriched clusters also expanded outside the 
NEN clade. Inside the NEN clade, these clusters 
expanded beside the hypothesized gain of NFN 
symbiosis nodes at many additional nodes (table 
$31). In our survey of all independent gene family 
expansions, we did not identify any cluster that 
displayed parallel expansions, one possibility 
among others that would indicate convergent 
recruitment for the independent gains of NFN 
symbiosis (7, 16). If NFN symbiosis indeed evolved 
multiple times, our genome-wide analysis re- 
vealed hundreds of clade-specific candidate genes 
that, together with gene co-option, may have 
played a role in the putative independent evolu- 
tions of this trait. 


Genomic evidence for multiple losses 


Testing homologies of a trait shared by multiple 
taxa typically involves assessing the trait in these 
species and inferring origins of the trait once 
homology is accepted or rejected. However, in- 
formation on the origin of a trait, and thus its 
homology, can also be obtained from taxa lacking 
the trait, by distinguishing primary absence (the 
taxon never had the trait) from secondary loss 
of the trait (23). It has been demonstrated that 
genes involved in a specific biological process 
are lost following the loss of that trait, a process 
known as gene coelimination (35-37). To test 
the multiple-losses hypothesis, we searched the 
ortholog groups calculated above for an evolu- 
tionary pattern that retained orthologs in all 
nodulating species but lost them in nonnodulat- 
ing ones. To filter and confirm the list of can- 
didate orthologous groups, we used the same 
two-step process combining an automated pipe- 
line with relaxed criteria for nodulating species 
followed by a careful manual curation and evalua- 
tion step with stringent criteria (presence in all 
nodulating species required). The automated 
pipeline with relaxed criteria resulted in a list 
of 121 candidate groups (table $32). During our 
manual confirmation and refinement, we rejected 
31 of these candidate groups because orthologs 
were absent from one or more nodulating species. 
Another 62 candidate groups were rejected be- 
cause orthologs of more than 50% of nonnodulat- 
ing species were present. A weak phylogenetic 
signal did not allow for inferring a reliable 
orthology for 27 candidate groups. A single 
gene, NODULE INCEPTION (NIN), was con- 
firmed to be present in the genomes of all the 
nodulating species and in the genome of species 
outside the NFN clade, and absent from most 
nonnodulating ones in the NEN clade (Fig. 2 and 
fig. S4). Forward genetic screens in the legumes 
Lotus japonicus (27), Pisum sativum (38), and 
M. truncatula (24) identified NIN as indispens- 
able for the two developmental aspects of the 
NEN symbiosis: initiation of root nodule develop- 
ment and the formation of the plant structure 
facilitating intracellular uptake of bacteria (27). 
Furthermore, RNA interference-based suppres- 
sion of NIN expression in Casuarina glauca 
(Fagaceae, Fagales) impaired nodule formation 
(25), consistent with a conservation of the 
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role of NIN in NFN symbiosis in actinorhizal 
host plants. 


Confirmation of NIN absence 
by microsynteny 


In addition to the presence of NIN in all the 
nodulating species in our dataset, synteny anal- 
ysis revealed the conservation of the syntenic 
blocks surrounding the NIN locus across the 
NEN clade (Fig. 2). By contrast, 10 of 13 non- 
nodulating species of the Fabales, Fagales, Cu- 
curbitales, and Rosales underwent partial (four 
species) or complete (six species) deletions of 
NIN from the conserved genomic block (Fig. 2). 
The legume family is divided into six subfamilies, 
two of which include nodulating species (39). 
According to previous estimates (7), Cercis (a 
member of Cercidoideae, which is sister to most 
or all other legumes) and Castanospermum 
(a member of a clade sister to the crown group 
of papilionoid legumes, in which nearly all 
members form the NFN symbiosis) both rep- 
resent lineages in which NFN symbiosis never 
occurred. By contrast, Nissolia and its sister 
genus Chaetocalyx are nonnodulating genera 
nested within the nodulating crown group of 
papilionoids and thus have been predicted to 
represent secondary loss of NFN symbiosis 
(11, 40). Our synteny approach allowed the 
discovery of the complete absence of NIN from 
the genome of N. schottii (Fig. 2). In addition, 
we confirmed the absence of NIN in three 
Chaetocalyx species (fig. S5). Because NIN was 
present in the most recent common ancestor 
of the NFN clade and conserved in nodulating 
species, the absence of this gene in the Nissolia- 
Chaetocalyx lineage represents a loss that cor- 
relates with, and is sufficient to explain, the loss 
of NFN symbiosis. Cercis canadensis harbored 
only a NIN pseudogene remnant in the genomic 
block, whereas the genome of C. australe com- 
pletely lacked NIN (Fig. 2). These results demon- 
strate three independent losses of NIN in the 
legume family. Similarly, the synteny analyses 
confirmed a minimum of three independent 
losses in nonnodulating Rosales and two in the 
Cucurbitales (Fig. 2). Together, the diversity of 
NIN deletions in the nonnodulating species is 
indicative of at least eight independent evolu- 
tionary events that led to the loss of NZN function 
(Fig. 3). Following the multiple-gains hypothesis 
(7), NFN symbiosis was predicted to have evolved 
independently at least six times in the species 
space sampled here (Fig. 2). Loss of NIN pro- 
vides an alternative model with at least eight 
independent losses of NFN symbiosis (Fig. 3). 
Thus, the current distribution of NFN symbiosis 
might be a combination of these two comple- 
mentary and not mutually exclusive models. 


Loss of RPG 


In parallel with these multiple confirmed losses 
of NIN, the synteny analysis also confirmed the 
presence of NIN in three nonnodulators of the 
NEN clade (Fig. 2). We hypothesized that the loss 
of genes other than NIN may explain the non- 
nodulating state of these species and that such 
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genes may have been missed by the specific 
and stringent criteria for the detection of the 
presence-absence patterns. In addition to NIN, 
we determined the presence-absence pattern 
of 21 genes that were identified by forward and 


reverse genetics as critical for NFN symbiosis GROWTH (RPG) (41), which is present outside 
in legumes (Fig. 3 and table $33). Among these | the NFN clade, is missing in N. schottti and 11 
genes, 20 were conserved in nodulating species other nonnodulating species from the four 
and most nonnodulating ones (figs. S6 to S25). orders of the NEN clade (Fig. 3). These losses 
By contrast, RHIZOBIUM-DIRECTED POLAR | were confirmed by microsynteny analyses for 


Oryza sativa LOC_0s09937710.2 on > NIN 
2 Pe ee MD Other genes 


ichypodjum distachi 


Migut. 000k DB 


GSVIVTO1 023886001. 


Populus trichocatba 


Linum usitatissimum, 


Ricinus communis a: 000322 
Manihot esculenta Manes.11G110900.1°p sit 
. —— 
Chafa4076S22354 
a pudica Mimpul984519289 = 
Castanospermum australe 
Nissolla schotal = X- 
rachis jpaeasis: ‘Araip.10007239 = 
Lotus: Japonicus 1j293v3373100.1 Sts 
He Ca_03225.1 _ 
e y 
aie Ff truncatula Mectrsgo cued 4 — = Sl enlllleeeeheedll 
Cai janus cajan C.cajan_37712 = 
vA 7 fe 
\Glyma.14G001600.1.p fn 
Phaseolus vulgaris Phyul.008G291800. 1. = 
Vignagediay \Vradi06g00060.1 ‘i 
NEN clade 
ll Fabales Dryas crummipssit Drydr380S11191 st 
@ Rosales 
® Cucurbitales 
J 
eee peab{0n65m 
| Humulus lupulus HL SwyLpconona6, : TNDET 
ne sco 1554 ower SUEEENISSnEEEEA 
eo XP_019893088.200 ner 
01 = 
Distr 414 ha 
+4 
-8 an 
Datisca glomerata Datgl_scaffolda69:2468791.1472708 as 
Citrullus lanat 
cates I oe 
a Suchinis sativus: Tsai iopano =a $$ 
-10 
pe J Ne da 
€asgl993S23747 sa 
Alnus glutindss Aingl424567527630 ies 
Fig. 2. Gene family expansions and contractions in the NFN clade. |n colored in green. NIN gene IDs are shown above the gene symbol. The 
the left panel, a species phylogeny of the used dataset is depicted. For synteny analysis upholds orthologous relationships drawn from the 
each node, the numbers of gene families showing expansions (+, blue) phylogenetic analysis and supports the absence of NIN in several species 
and contractions (-, red) are given. Blue boxes point out nodes hypothesized by verifying the existence of contiguous N/N regions without NIN genes. 
to be positions of independent gain events of the NFN symbiosis, including Enlarged gene models are only shown for fragmented NIN genes in 
all suggested alternative models in the given dataset (7). For example, comparison to the full M. truncatula NIN gene. Blocks with no fill and green 
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independent gains. A black arrow marks the base of the NFN clade. The Medicago NIN. Insertions and deletions (INDEL) and premature stop 


right panel depicts syntenic relationships of the NIN region. NIN genes are codons (*) are symbolized by a vertical red line. 
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all nonnodulating species (fig. S26). In the 
M. truncatula rpg mutant, infection threads 
are still present but their structure is abnormal 
(41), indicating that RPG, similar to NIN, is re- 
quired for proper infection-thread progression. 
However, in contrast to nin mutants, nodules are 
formed on rpg mutant roots (47). Among the 
nodulating species, RPG is absent in the papilionoid 
Arachis ipaensis (Fig. 3). In the genus Arachis, 
infection threads were not observed; instead, rhizo- 
bia appear to infect nodules intercellularly (42). 
Polymorphism in RPG may represent an inter- 
mediate step on an evolutionary path toward the 
loss of this symbiosis in Arachis, a genus in which 
NEN symbiosis is described as a labile trait (43, 44). 
Absence of RPG in Arachis also explains why 
the genome-wide comparative phylogenomic 
approach did not identify this gene, given that 
the pipeline required the candidate genes to be 
present in all nodulating species. RPG was one 
of the genes rejected for not fulfilling this crit- 
erion. Among nonnodulating species, Juglans 
regia (Fagales), Ziziphus jujuba, and Prunus 
persica (Rosales) have lost RPG but retained NIN, 
suggesting that additional mutations might be 
causative for the loss of NFN symbiosis in these 
species. Some of the candidate mutation targets, 
for example, lysine motif (LysM) receptors in- 
volved in the perception of symbiotic signals 
produced by nitrogen-fixing nodulating rhizobia, 


Fig. 3. Phylogenetic pattern 
of NFN symbiosis-—related 


Wi ortholog present 
4 co-ortholog of NIN/NLP1 or RPG/RRP 
§ ortholog fragmented INF 


will be difficult to identify by phylogenomic anal- 
ysis, because of the rapid evolution and expan- 
sionary dynamics of these gene families (45). In 
contrast to other symbiosis-relevant genes involved 
in infection, NIN and RPG are only known to have 
NEN symbiosis-specific functions, whereas the 
mutation of other genes may have more pleiotropic 
effects. For example, the key signaling components 
SYMRK, CCaMK, and CYCLOPS are involved in 
both NFN symbiosis and arbuscular mycorrhizal 
symbiosis, the most widespread symbiosis in 
land plants (46). Mutations in any of these three 
genes would also affect arbuscular mycorrhizal 
symbiosis. Illustrating this dual selection pressure, 
retention of these genes has been described in the 
genus Lupinus, which lost the arbuscular my- 
corrhizal symbiosis but retained NFN symbiosis 
(37, 47, 48). Given that both NIN and RPG are 
present in species outside the NFN clade (Fig. 3 
and figs. S4 and S6), their consistent losses in 
nonnodulating species suggest the shift in con- 
straints on sequence evolution specifically in the 
NFN clade, which might be mirrored by 
signature of relaxed or positive selection on both 
genes at the base of the NFN clade. We 
investigated the selective pressure acting on the 
NEN clade for these two genes using the PAML 
package (49). Results did not reveal a statisti- 
cally significant positive or relaxed selection 
occurring in the NFN clade that would have 


© independent gain of root nodule symbiosis 
> independent loss of NIN 


reflected a putative neo functionalization for NFN 
symbiosis-specific functions (table $34). 


Discussion 


In recent decades, the favored model to explain 
the scattered occurrence of NFN symbiosis in 
flowering plants predicted a single predisposi- 
tion event at the base of the NFN clade followed 
by up to 16 origins, even though the occurrence 
of multiple losses was never excluded (7, 11). 

Our genome-wide comparative analysis did 
not detect gene gains specific to the NFN clade 
and maintained in all nodulating species. Such 
genes would have been ideal candidates for either 
the predisposition event (in the multiple-gains 
hypothesis) or the evolution of NFN symbiosis 
itself in the hypothesis that NFN symbiosis evolved 
only once in the most recent common ancestor 
of the NEN clade (the single-gain hypothesis). 
This indicates that this step involved either fast- 
evolving genes that were not captured by our 
phylogenomics pipeline or subtler genetic changes. 
Evolutionary developmental genetics in plant, 
fungal, and animal systems have revealed that 
even more than gains of genes, new traits often 
arise from the rewiring of existing gene networks 
via gains or losses of cis regulatory elements lead- 
ing to the co-option of ancestral genes (50). A sim- 
ilar mechanism may have acted in the most recent 
common ancestor of the NEN clade. 
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Co-option of different, or homologous, in 
the case of deep homology, genetic compo- 
nents may lead to the convergent evolutions of 
nonhomologous traits in multiple species (50). 
Deep homology has been invoked for the evolu- 
tion of NFN symbiosis, during either the pre- 
disposition or the following putative multiple 
gains (7). Our results support this hypothesis, 
given that all the genes characterized for their 
involvement in NFN symbiosis in legumes were 
already present in the most recent common 
ancestor of the NFN clade and that we did not 
detect genes specific to the NFN clade that were 
conserved in all nodulating species (Figs. 2 and 3). 
For the putative multiple gains of NFN symbiosis, 
it cannot be excluded that gene gains were also 
involved in addition to co-option of ancestral 
pathways. We identified hundreds of such 
lineage-specific candidate genes (Fig. 2). How- 
ever, considering that most of the predicted gains 
in our analysis are located in terminal taxa that 
are known to accumulate orphan genes and 
species-specific duplications in comparative ap- 
proaches, it can be anticipated that only a subset 
of them participated in the evolution of NFN 
symbiosis (in the multiple-gains hypothesis) or 
in lineage-specific refinements of the trait (in 
the single-gain hypothesis). 

Our results validate another hypothesis: mul- 
tiple independent losses of NFN symbiosis in the 
four orders of the NFN clade. In a classical model 
of evolution, if the number of losses necessary to 
explain the distribution of a trait in a given clade 
outnumbers the predicted gains, multiple gains 
will be favored over multiples losses to explain 
the distribution of the trait. In legumes, up to 
six gains of NFN symbiosis were predicted (7), 
whereas the clear losses that we identified in 
C. canadensis, C. australe, and N. schotii now 
argue for a single origin before the radiation of 
the family (Fig. 3). Beyond legumes, the number 
of validated losses of NFN symbiosis is consistent 
with a single gain of this symbiosis in the most 
recent common ancestor of the NFN clade, even 
though it does not reject the possible occurrence 
of multiple gains. The recent identification of loss 
of NFN symbiosis in the Rosales Trema orientalis 
brings further support to this hypothesis (57). 
Besides NFN symbiosis, the single or multiple 
origin(s) of other traits, such as the evolution of 
complex multicellularity in fungi, are currently 
debated with the accumulating evidence of mul- 
tiple losses demonstrated by the loss of associated 
essential genes (52). Thus, multiple gains and mul- 
tiple losses are not mutually exclusive scenarios to 
explain the evolution of complex traits such as 
NEN symbiosis. This also suggests that reduction, 
similarly to the evolution of complexity, might be 
a major driver of the phenotypic diversity ob- 
served in extant organisms (36, 53). 

The fixation of loss-of-function alleles of NIN 
(either complete loss or pseudogenization) in 
nonnodulating species provides the genetic ex- 
planation for the loss of NFN symbiosis in 10 
species representing eight nonnodulating line- 
ages. Fixation of such alleles requires ecological 
conditions in which the cost of symbiotic 
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nitrogen-fixation—involving infection, building 
nodules to host bacteria, and providing carbon 
to feed them—outweighs the benefit to the plant. 
In most terrestrial habitats, nitrogen is limiting 
(54), suggesting that the scale should be tipped 
toward the conservation of NFN symbiosis once 
this complex trait evolves. In nitrogen-rich hab- 
itats, NFN symbiosis is known to be inhibited in 
legumes (55). Long-term fertilizer application 
would make NFN symbiosis-specific genes super- 
fluous, leading to their eventual mutational in- 
activation and loss. In addition to this abiotic 
constraint, NFN symbiosis may be undermined 
by “cheating” bacteria that gain entry into root 
nodules and are fed by the plant but do not 
deliver nitrogen (56-58). Cheaters may there- 
fore imbalance the trade-off between the costs 
and benefits of the association, as already pro- 
posed on the basis of patterns of legume NFN 
symbiosis in Africa (59). This would result in 
the loss of NFN symbiosis being adaptive, thus 
providing an ecological explanation for the oc- 
currence of this symbiosis in a few flowering 
plant species. In this context, the finding that 
NIN participates in shaping the root microbiome 
beyond NFN symbiosis makes it a target of 
adaptive selection against this symbiosis (60). 

Engineering biological nitrogen fixation in 
crops remains a goal of plant synthetic biologists, 
with the aim of improving food production in 
developing countries in which the application of 
nitrogen fertilizer is limited by economic and 
infrastructural constraints. Our results supporting 
the occurrence of multiple independent losses 
indicates that the apparent selection against 
NEN symbiosis must be taken into account by 
projects whose aim is to improve legumes and, 
even more, when considering the engineering 
of nitrogen fixation in other crops. 


Materials and methods 
Plant material and sample preparation 


The origin of the plant material used for DNA or 
RNA extraction in this study is summarized in 
data S2. 

Methods for plant growth, DNA extractions, 
and RNA extractions are described in the sup- 
plementary materials online. 


Genome sequencing 


Figure S1 describes the overall strategy and 
results of the dataset production in this study. 

Whole-genome sequencing for the 10 genomes 
was performed using Illumina sequencing tech- 
nology (HISEq. 2000 and HISEq. 4000) at BGI- 
Shenzhen. Hierarchical library construction 
strategy was applied that typically included 
multiple paired-end libraries with insert sizes of 
170, 250, 350, 500, and 800 bp and mate-pair 
libraries with insert sizes of 2, 5, 10, and 20 kb. 
Most of the paired-end and mate-pair libraries 
were prepared from large genomic fragments, 
typically of size 20 to 40 kb, or even larger. For 
some species, more small-insert-size PE libraries 
were constructed to complement the limited 
mate-pair libraries. The library construction 
for each species is summarized in table S36. 
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Deep genome sequencing was performed for most 
of the species, with at least 110-fold coverage after 
a stringent data filtering and the highest sequenc- 
ing depth reached 535-fold in cleaned data. 

The overview statistics of data production are 
summarized in table S2 and fig. S1. 

To minimize sequencing errors and reduce 
genome assembly artifacts, several quality con- 
trol steps were taken to filter out low-quality 
sequencing reads: 

Removal of N-rich reads: Reads that contained 
more than 10% of “N’s bases or polyA structure 
were removed. 

Removal of low-quality reads: Reads in which 
40% of the bases were low-quality (quality scores 
<7) were filtered out. 

Filtering of reads with =10 nt aligned to the 
adapter sequences: Adaptor sequences were 
aligned to read1 and read2 using a dynamic 
programming approach; if the aligned fragments 
from read1 or read2 were reversed complementary 
to each other, the pair was also removed. 

Filtering of small-insert-size reads with insert 
size (170 to 800 bp): The overlapping length be- 
tween read1 and read2 is =10 bp; 10% mismatch 
was allowed. 

Filtering of PCR duplicates: If the PE read1 
and read2 were 100% identical, these reads were 
treated as duplicates and only one was retained. 

Trimming of read ends: The low-quality bases 
from read ends (5'-5 bp, 3'-8 bp) were directly 
trimmed. 

This filtering process was carried out using an 
in-house Perl program. After filtering, on average, 
150-fold sequencing coverage was generated. For 
each species, clean reads were then passed to the 
genome assembler pipeline for de novo genome 
assembly. 

The statistics of clean data are also summar- 
ized in table S2. 


Genome assembly 


To optimize the strategy for genome assembly, a 
genome survey is necessary to estimate the gen- 
ome complexity. Some genomes are abundant 
in repetitive content and/or maintain a high het- 
erozygosity rate through genome k-mer-analysis. 
Ak-mer refers to a continuous sequence with k 
base pairs, typically extracted from the reads 
(thus, shorter than the read length, e.g., 17 bases 
per k-mer). If an “ideal” sequencing dataset is 
produced from a randomly whole-genome shotgun 
process without sequencing errors or coverage 
bias, the start positions of reads along the genome 
will follow Poisson distribution (61). Supposing 
that the read length is far shorter than the genome 
size, the k-mer can be regarded as randomly 
generated from the genome and their occurrence 
(sequencing depth) also is expected to be Poisson 
distributed (fig. S3A). Based on this assumption, 
the genome size can be estimated as (62): 


k — mernumber 


Genome size = 
Average sequencing depth 


For a “normal” diploid genome, the k-mer fre- 
quency produced from adequate reads would 
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follow Poisson distribution. For genomes which 
are either repeat-rich or highly heterozygous, 
an additional peak either indicative of highly re- 
petitive content (typically twofold depths of the 
main peak) or of high heterozygosity (63) (typi- 
cally half depths of the main peak) is expected 
next to the “main peak” (indicative of the normal 
diploid genome) from the frequency distribution, 
or some even more complex scenarios either 
caused by the unexpected non-canonical genomic 
characteristics (e.g., degree of heterozygosity, 
complexity of the size and distribution of re- 
petitive content, etc.) coupled with the use of 
different k-mer sizes. 


K-mer statistics and distributions are 
presented in fig. S2 and tables S6 to S25. 


During de novo genome assembly, we tried dif- 
ferent k-mers (from 23- to 33-mer) to construct 
contigs, and the best k-mer (with the largest 
contig N50 length) was selected for the final run. 
Owing to differences in genome complexities be- 
tween species, multiple genome assemblers were 
applied to achieve the optimal assembly result. As 
described in fig. $1, SOAPdenovo2 (version 2.04) 
(64) was the most frequently used assembler and 
Platanus (version 1.2.4) (65) for highly hetero- 
zygous genomes. After several rounds of assembly 
evaluations regarding contig contiguity and ge- 
nome completeness, the best assemblies (largest 
contig N50 and highest BUSCO gene mapping 
rate) were selected for the downstream gap- 
closing step by Gapcloser (version 1.2) (64). For 
the assembly of Discaria trinervis genome, we 
used the Celera assembler CA8.3rcl. Because the 
Celera assembler (66) is sensitive to excessive 
coverage, library sizes were down-sampled to 
equal sized batches with a total coverage of ap- 
proximately 50-fold. In a subsequent step, the 
entire sequence information was used to generate 
scaffolds and close gaps with SSpace (67) and 
Gapcloser, respectively. 

Table S3 indicates the assembly strategy for 
each genome. 

The statistics of genome assemblies are sum- 
marized in table S3, and the details for each 
species are presented in tables S6 to S25. 


Genome assembly evaluation 


The assembly evaluations for all of the genomes 
are provided in table S4. 

Basically, mapping of the 1440 ultra-conserved 
core eukaryotic genes from the BUSCO (68) 
dataset resulted in >90% of the core eukaryote 
genes recovered for most of the genome as- 
semblies. Taken together, these results indicate 
good genome assembly qualities for most of the 
newly sequenced species in this study, especially 
with respect to the genic regions (Fig. 1). 


Genome annotation 

A schematic workflow for genome annotation 
is given in fig. S1. 

Repeat identification 


Identification of transposable elements (TEs) was 
carried out by RepeatMasker (version 4-0-5) (69). 
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A custom repeat library was constructed for each 
species by careful self-training. To construct the 
repeat custom library, we first collected the min- 
iature inverted repeat transposable elements 
(MITEs) from many closely-related species and 
created a lineage-specific custom library by MITE- 
hunter (70) with default parameters. For the 
prediction of long terminal repeats (LTRs), we 
used LTRharvest (77) integrated in Genometools 
(version 1.5.8) (72), defining LTR in the length of 
1.5 to 25 kb, with two terminal repeats ranging 
from 100 to 6000 bp with =99% similarity. Ele- 
ments with intact PPT (poly purine tract) or PBS 
(primer binding site) were necessary to define 
LTR, which were identified by LTRdigest (77) 
using a eukaryotic tRNA library (http://gtrnadb. 
ucsc.edu/), whereas elements without appropriate 
PPT or PBS location were removed. To remove 
false positives such as local gene clusters and 
tandem local repeats, 50-bp flanking sequences 
on both sides of the LTRs of each candidate ele- 
ment were aligned using MUSCLE (73) with de- 
fault parameters; if the identity was greater than 
or equal to 60%, the LTR element was considered 
as a false positive and removed. LTR elements 
nested with other inserted, but unrelated, com- 
ponents were also removed. Exemplars were built 
using a cutoff of 80% identity in 90% of element 
length from an all versus all BLASTn search. 
Terminal repeat retrotransposon in miniature 
(TRIM) libraries, with lengths of 70 to 500 kb, 
were built following a similar prediction strategy. 
Furthermore, the genomic sequence was masked 
to run RepeatModeler (version 1-0-8) (69) to ex- 
tensively de novo predict repetitive sequences 
for each species. The MITE, LTR, and TRIM re- 
petitive sequence libraries were integrated to- 
gether to make a complete and nonredundant 
custom library. This custom repeat library was 
taken as the input for RepeatMasker to identify 
and classify transposable elements genome-wide 
for each species. 


Gene annotation 


Repeat elements were masked for each genome 
assembly before gene model prediction. Protein- 
coding genes were identified using the MAKER-P 
pipeline (version 2.31) (74) with two rounds of 
iterations. To obtain an optimal gene prediction, 
a series of trainings was performed. First, for ge- 
nomes that have RNA samples sequenced, a set 
of transcripts was generated by a genome-guided 
approach using Trinity and then mapped back to 
the genome using PASA (version 2.0.2) (75). This 
process generated a set of complete gene models 
from each genome assembly and thus obtained 
real gene characteristics (size and number of 
exons and introns per gene, distribution of genes, 
features of splicing sites, etc.) by Augustus (76). 
Genemark-ES (version 4.21) (77) was self-trained 
with default parameters. SNAP (78) was trained 
using RNA- or protein-based gene models from 
the first iteration of MAKER-P pipeline. For RNA- 
seq aided gene annotation, RNA clean reads were 
assembled into inchworms using Trinity (79). For 
some species, transcriptome and ESTs data were 
obtained from NCBI or 1KP database if available 
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(https://sites.google.com/a/ualberta.ca/onekp/). 
An optimal core protein set was collected from 
several closely related species for homolog-based 
gene prediction for each species, for example, gene 
models from the model plants like Arabidopsis 
thaliana and Oryza sativa, as well as from some 
well-annotated legumes like Medicago and Glycine 
max. Default parameters were used to run 
MAKER-P with all integrated annotation sources 
and to produce the final set of gene models for 
each species. The number of gene models for 
each species is summarized in table S26, and 
detailed statistics are summarized in table S28. 
BUSCO evaluation suggests complete and reli- 
able gene annotation for all newly sequenced 
genomes (tables S4 and S5). 

HMMER-based engineer InterproScan (version 
5.11) (80) was used to predict gene function from 
several functional databases. The motifs and 
domains of genes were determined by searching 
against protein databases. An integrated gene 
functional annotation is summarized in table S29 
for all species. 


Transcriptome sequencing 


RNA samples were sequenced for seven species 
to assist gene prediction in this study. The over- 
view of total RNA samples with various tissues is 
summarized in table $37. For each RNA sample, 
a pair-end library with insert size of ~200 bp was 
constructed following the manufacturer protocol. 
Libraries were barcoded and pooled together 
as input to the Illumina Hiseq 4000 platform 
for sequencing. All of the RNA samples were 
sequenced in-depth, with an average of 6-Gb se- 
quences per sample, to ensure a complete co- 
verage for each transcriptome. 


Genome-wide comparative 

phylogenomic analysis 

Clustering of gene families 

Clustering of gene families is based on predicted 
proteomes of the gene annotation of 37 species 
(table Sl). To generate a set of nonredundant 
representative sequences, we removed multiple 
isoforms of a gene applying a cd-hit clustering 
using an identity threshold of 99.5% (87). Sub- 
sequently, homologs were identified with an all 
versus all blastp (v.2.2.30+) search of the 37 
species preoteomes. For each query-subject-pair, 
we summed up the aligned sequence of all its 
HSPs (blast high scoring pair), ignoring overlaps, 
and compared it with the sequence length of 
both subject and query. We removed all query- 
subject-pairs from the blast tables for which the 
alignment coverage was less than 40% of either 
the total query or subject sequence length. Accord- 
ing to Yang and Smith (82), this hit fraction filter 
step with the used cutoff of 40% substantially 
improves phylogenetic trees and orthology infer- 
ence in the subsequent steps. On the basis of these 
modified blast tables, we clustered the remain- 
ing homologs to gene families with OrthoFinder 
(inflation parameter of 1.3) (20). The result- 
ing 29,433 gene families were used as a start- 
ing point for the genome-wide phylogeny-based 
ortholog presence-absence analysis and candidate 
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confirmation. We also calculated gene family 
clusters without applying the 40% hit fraction 
filter and used these 23,869 gene family clusters 
for the analysis of gene copy-number variation. 
Figure S3 provides an overview both for the 
individual steps and the complete pipeline. 


Genome-wide phylogeny-based ortholog 
presence-absence analysis 


For each OrthoFinder gene family cluster, a 
separate phylogeny was calculated with FastME 
(83). As suggested by Yang and Smith (82), un- 
reliable or wrongly resolved super-long branches 
were removed from each family tree in the fol- 
lowing way: For each tree, average length and 
standard deviation of terminal branches were 
calculated, and branches longer than the mean 
of the average terminal branch length plus 
threefold standard deviation were removed. A 
python script was written to root pruned trees 
with farthest-oldest outgroup method [imple- 
mented in the Python package ETE 3 (84)], and 
ortholog-paralog relationships were inferred with 
the species overlap algorithm (implemented in 
the Python package ETE 3) (84). We then searched 
lists of orthologs of each gene of the reference 
species, the well-characterized nodulating legume 
Medicago, in total 29,213 ortholog lists, for the 
presence or absence of orthologs in all remaining 
36 species of our dataset. The following criteria 
were applied to identify candidates: 

1) orthologs present in at least 66% of the 
nodulating species (10 of 15), 

2) orthologs present in a fraction of nodulat- 
ing species from each order of the NFN clade (at 
least 7 of 10 nodulating Fabales, at least 1 of 2 
nodulating Rosales, at least 1 of 2 nodulating 
Fagales, 1 of 1 nodulating Cucurbitales), 

8) (only for the predisposition hypothesis) 
orthologs absent from ALL outgroup species 
outside of the NFN clade, and 

4) (only for the multiple-losses hypothesis) 
orthologs absent from more than 50% of the 
nonnodulating species (at least 7 of 13). 

Instead of filtering for the presence of orthologs 
of all nodulating species (10/10 Fabales, 2/2 
Rosales, 2/2 Fagales, and 1/1 Cucurbitales), we 
used relatively relaxed criteria (as defined in 
land 2) for nodulating species to avoid missed 
potential candidates owing to erroneous gene 
annotations (e.g., false negatives from gene an- 
notation pipelines). Each of the candidates 
resulting from these criteria (predisposition 
hypothesis: 31 candidates; multiple-losses hy- 
pothesis: 121 candidates) underwent a refined 
candidate analysis described below with stricter 
criteria for nodulating species. 

Besides the candidates of the genome-wide 
presence-absence analysis of phylogeny-based 
orthologs, we collected 22 candidate genes (table 
$33) that have been reported to be involved in 
root nodule symbiosis mostly from the model 
legume organisms Medicago and L. japonicus. 
For each OrthoFinder gene family cluster con- 
taining one of these genes, we calculated max- 
imum likelihood gene family trees (RaxML v.8.2.4: 
Model: CATWAG). On the basis of the topology 
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of the gene family tree and its protein alignment 
(MAFFT v.7.222 L-INS-I (85), trimming: BMGE 
gap-rate cut-off 80%) (86), we manually selected 
the subtree that contained the gene of interest 
(orthogroup). Subsequently we realigned (MAFFT 
v.7.222 L-INS-i, trimming: BMGE gap-rate cutoff 
20%) and recalculated the phylogenetic tree 
(RaXML v.8.2.4 Model: GAMMAJTT, 200 boot- 
straps) (87) of the orthologous group to improve 
the quality of the subtree. Trees were rooted 
manually. Starting from the gene of interest and 
traversing to the root of the tree, we marked all 
nodes as duplication or speciation events. If the 
two subclades of a node shared genes that were 
originating from the same species, this node was 
interpreted as a duplication, otherwise as a 
speciation event. On the basis of speciation 
nodes, we inferred the orthologs of the gene of 
interest. At duplication nodes, all genes in the 
subclade lacking the query gene were inferred 
as paralogs. In the case of a speciation node, all 
genes belonging to both subclades of that par- 
ticular node were inferred as orthologs of the 
query gene unless they were annotated as par- 
alogs at a previous node. All orthologs with 
incomplete gene models were removed as long 
as they had paralogs among the species they 
were derived from, keeping at least one ortholog 
per species. We defined gene models as in- 
complete or fragmented if more than 20% of 
the conserved amino acid sequence was absent. 
As conserved amino acid sequences, we used the 
trimmed alignments (BMGE 20% gap-rate cutoff). 
To avoid false conclusions for missing orthologs, 
we retested a potential absence of genes by a 
homolog search. In such a case and in the case 
of still-incomplete gene models, we searched the 
complete genome sequence of the corresponding 
species (tblastn v2.2.30+, default parameters) 
with the closest homolog from the gene tree for 
regions containing potential gene loci of putative 
orthologs. These regions were then used to pre- 
dict gene models (fgenesh+) (88). The resulting 
gene models were included in the set of se- 
quences of the orthologous group, and another 
round of alignment and tree calculation was 
performed (same settings and tools as last round). 
These identified sequences complementing the 
species gene annotation are provided as a se- 
parate fasta-formatted file in data S1. The result- 
ing tree was then used for a final round of ortholog 
inference so that the final set of orthologous genes 
is the result of an iterative process with constant 
improvement of gene models and phylogenies. If 
a complete gene model was not detected, the 
fragmented model was used and the ortholog 
was annotated as “fragmented” for the respective 
species. Fragmented models were only annotated 
as complete if the different fragments merged to 
a complete model and the fragmentation could 
be explained by the fragmentation of genomic 
scaffolds. 

For the prefiltered candidates from the 
genome-wide phylogeny-based ortholog presence- 
absence analysis, we used stricter criteria than 
in the fully automated approach that led to the 
prefiltered candidates. We only kept such can- 
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didates from the genome-wide phylogeny-based 
automated presence-absence pipeline for which 
orthologs of all nodulating species were present 
and orthologs in more than 50% of nonnodulating 
species (7 of 13) were absent. Orthologs absent 
from the orthofinder output were independently 
searched by microsynteny to exclude the possibility 
that the ortholog could not be found because the 
syntenic region of the ortholog was not in the 
corresponding genome assembly (for more detail 
see the “synteny analysis” section). The presence, 
absence, and fragmentation of orthologs for each 
of the 22 selected known symbiosis genes and 
each species is summarized in Fig. 3. The diver- 
sification times shown in the chronogram are 
based on estimates from Bell et al. (Outgroup, 
BEAST, 36 minimum age constraints treated as 
log-normal distributions) (89), Xi et al. (Mal- 
pighiales, BEAST, uncorrelated lognormal model) 
(90, 91), and Li et al. (NFN clade, r8s, 1008 
taxatree) (13). Phylogenetic trees for all candidate 
genes are provided in figs. S4 and S6 to $25. 


Analysis of gene copy-number variation 


OrthoFinder gene family clusters were used to 
identify nodes in the species tree of our dataset, 
where gene family expansions or contractions in 
fast-evolving gene families occurred. The number 
of genes for each species of each cluster were 
counted and analyzed with the software tool CAFE 
(92). Following instructions given in the CAFE 
manual, we removed gene family clusters with 
strong outliers in gene copy number. Therefore, 
we excluded 193 gene family clusters from the 
analysis for which the difference between max- 
imum copy number and median copy number 
was greater than or equal to 50 copies, which 
meets the elbow criterion to identify the optimal 
number of clusters in a clustering problem 
(fig. S27). To avoid overestimation of gene family 
contractions, we only used gene family clusters 
that contained orthologs from at least 28 species. 
This enabled us to analyze gene families that lost 
all genes in zero up to nine species. We chose this 
cut-off because of the nine outgroup species in 
the dataset. In the extreme case that all these 
nine outgroups have a gene count of zero, we 
could still analyze gene families originating from 
the last common ancestor of the NFN clade. After 
applying this cut-off, a total of 10,237 gene 
families were kept for the gene family evolution 
analysis (automatic 4 and u estimation, signif- 
icance level for fast-evolving families 5%). The 
results of the analysis are shown in Fig. 2. 
From the Medicago Gene Expression Atlas, we 
obtained all differentially expressed probesets 
that were either up-regulated with a fold change 
of 2 or down-regulated with a fold change of 0.5 
in root nodule tissue of different age (7, 10, 14, 
and 28 dpi) compared to untreated root tissue. 
We associated all of these 18,131 regulated pro- 
besets to 17,521 Medicago v4.0 gene IDs using the 
mapping file provided by MtGEA. For each of the 
14 hypothesized independent gain-of-NFN sym- 
biosis nodes (Fig. 2, blue boxes), we extracted all 
gene family clusters that showed expansions at 
these nodes. For each of these clusters, we counted 
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the number of transcriptionally regulated and not 
regulated Medicago genes, ignoring all genes 
that could not be mapped to probesets. To test 
whether a gene family cluster was enriched for 
Medicago genes differentially expressed in nodulat- 
ing versus mock control roots, we performed 
Fisher’s exact test on a significance level of 5%. 


Synteny analysis 

Genome-wide syntenic and collinear blocks were 
identified across the 37 selected genomes in this 
study. First, all versus all Blastp (E-value < 1e?°) 
was performed on the translated protein sequences 
of the 37 sets of annotated gene models, resulting 
in a database of protein similarity. We then used 
the Multiple Collinearity Scan (Mcscan toolkit 
version 1.1, 2016, >5 homologous gene pairs/block) 
to identify conserved collinear blocks between the 
37 genomes, creating a syntenic or collinear block 
database across all of the 37 species. To find all of 
the homologous syntenic blocks of interest, we 
first used Medicago genome as a reference, search- 
ing and locating the target genes along the 
syntenic blocks with the flanking genes sur- 
rounding up- or down-100 kb genomic regions 
as well as the counterparts from different genomes. 
For any other given genome, the optimal collinear 
block (the highest score if multiple duplicated 
blocks were found) was defined according to 
conservation of gene content (the largest num- 
ber of orthologous gene pairs) and consistency of 
gene order. If a corresponding ortholog was 
present in the collinear block from other aligned 
genomes, this was called scenario-1 (indicating 
that synteny supports gene presence and con- 
sistent with the genome-wide gene family ortholog 
prediction); if the target orthologous gene was 
absent from the collinear blocks of the given ge- 
nome, this was called scenario-2 (synteny supports 
gene absence). After this first round was finished, 
to complete any missing genes or blocks owing 
to weak alignment signals, we classified these 
identified orthologous genes as well as the cor- 
responding collinear blocks and repeated the 
searching and locating process between genomes 
according to their evolutionary proximity in 
phylogenetic position (using the most closely 
related genome as query). By this process, we 
updated scenario-1 and scenario-2 as described 
above. In addition, for some species, no collinear 
block was identified around the possible target 
gene, and we manually revisited the protein sim- 
ilarity database and searched the genomic regions 
flanking the Medicago gene to confirm the gene 
presence or absence supported by synteny. Finally, 
if no synteny was identified, but the candidate 
ortholog gene was predicted from the genome- 
wide gene family analysis, we called this as 
scenario-3 (gene presence without synteny sup- 
port, which indicates possible gene transloca- 
tion and synteny erosion). 


Detecting selection pressure 
on gene trees 


For NIN and RPG, protein sequences were aligned 
using MAFFT v7.380 (85). The protein alignment 
served as matrix for codon alignment performed 
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using the Perl script pal2nal v14 with the -nogap 
option enabled to remove all gapped positions. 
Codon alignments were then subjected to a maxi- 
mum likelihood analysis using IQ-TREE v1.6.1 (93) 
with 10,000 Ultrafast bootstraps replicates (94). 
The best-fitted evolutionary model was previously 
investigated using ModelFinder (95). The unrooted 
tree obtained was controlled to fit with the evo- 
lutionary frame of species, and the NFN clade 
labeled as the foreground branch that was tested 
for being under positive selection. The latter was 
investigated using the branch-site model A imple- 
mented in the codeml module from the PAML 
package v4.9 g (49). An alternative hypothesis 
(NEN clade may have proportion of sites under 
positive selection) was compared to the null hy- 
pothesis (NFN clade may have different propor- 
tion of sites under neutral selection compared to 
the other clades). For the null model, the param- 
eters were set as follows: “model = 2, NSites = 2, 
fix_kappa = 0, fix_omega = 1 and omega = 1,” 
whereas the parameters for the alternative mod- 
el were: “model = 2, NSites = 2, fix_kappa = 0, 
fix_omega = 0 and omega = 1.5”. The two hypothe- 
ses were compared using the likelihood ratio 
test based on a x” distribution with one degree 
of freedom. If the alternative hypothesis was 
validated, codon sites likely to fall under positive 
selection were identified using the Bayes Empir- 
ical Bayes procedure (49). 


PCR validation of the absence of NIN 
in nonnodulating legumes 


A nested PCR approach was undertaken using 
primers designed on an alignment of NIN gDNA 
from Medicago and M. pudica. These primers are 
designed to amplify ~120 bp on Exon 4, ~170 bp 
on Exon 5, and the intron in between. The size of 
this intron ranges from 133 bp in Medicago to 397 
bp in M. pudica. For the first PCR, we used the 
degenerated primer pair NIN-Fwd-3 5’-GGAGAA- 
AGTCMGGCGASAA and NIN-Rev-3 5’-GRAARCTG- 
GCATAGAATGA. The Nested PCR was run with 
0.2 ul of the PCR reaction and primers NIN- 
Fwd-2 5'-CGAACCAAGGCTGAGAAGAC and NIN- 
Rev-2 5'-ATCTGTATGGCACCCTCTGC. The first 
PCR run was 94°C 30 s, 45°C 1 min, 72°C 1 min for 
35 cycles, and the second PCR run was 94°C 30 s, 
50°C 1 min, 72°C 1 min for 35 cycles. For all 
species, the PCR was run on >2 samples. In ad- 
dition, a PCR on the 28S was run to confirm the 
quality of the samples. All PCRs were run using a 
GoTaq DNA polymerase. 


REFERENCES AND NOTES 


1. P.C. Dos Santos, Z. Fang, S. W. Mason, J. C. Setubal, R. Dixon, 
Distribution of nitrogen fixation and nitrogenase-like sequences 
amongst microbial genomes. BMC Genomics 13, 162 (2012). 
doi: 10.1186/1471-2164-13-162; pmid: 22554235 
2. P.H. Beatty, A. G. Good, Future prospects for cereals that fix 
nitrogen. Science 333, 416-417 (2011). doi: 10.1126/ 
science.1209467; pmid: 21778391 
3. L. Hiltner, Uber die Bedeutung der Wurzelknéllchen von Alnus 
glutinosa fiir die Stickstoffernahrung dieser Pflanze. 
Landw. Versuchsstat 46, 153-161 (1895). 
4. H. Hellriegel, H. Wilfarth, Untersuchungen tiber die 
Stickstoffnahrung der Gramineen und Leguminosen 
(Buchdruckerei der Post Kayssler, Berlin, 1888). 
5. D.E. Soltis et al., Chloroplast gene sequence data suggest 

a single origin of the predisposition for symbiotic nitrogen 


13 July 2018 


wo 


i 


16. 


17. 


19. 


20. 


z 


a 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


fixation in angiosperms. Proc. Natl. Acad. Sci. U.S.A. 


92, 2647-2651 (1995). doi: 
pmid: 7708699 


0.1073/pnas.92.7.2647; 


C. Kistner, M. Parniske, Evolution of signal transduction 
in intracellular symbiosis. Trends Plant Sci. 7, 511-518 
(2002). doi: 10.1016/S1360-1385(02)02356-7; 


pmid: 12417152 


J. J. Doyle, Phylogenetic perspectives on the origins of 


nodulation. Mol. Plant Microbe Interac 
doi: 10.1094/MPMI-05-11-0114; pmid: 


t. 24, 1289-1295 (2011). 
21995796 


J. |. Sprent, J. Ardley, E. K. James, Biogeography of nodulated 
legumes and their nitrogen-fixing symbionts. New Phytol. 215, 


40-56 (2017). doi: 10.1111/nph.14474: 
K. Pawlowski, K. N. Demchenko, The 


; pmid: 28211601 
diversity of actinorhizal 


symbiosis. Protoplasma 249, 967-979 (2012). doi: 10.1007/ 
s00709-012-0388-4; pmid: 22398987 


. S. M. Swensen, The evolution of actinorhizal symbioses: 


Evidence for multiple origins of the symbiotic association. 
Am. J. Bot. 83, 1503-1512 (1996). doi: 10.1002/ 


j.1537-2197.1996.tb13943.x 


E. T. Kiers, A single evolutionary inno’ 
the deep evolu 


. GD. A. Werner, W. K. Cornwell, J. |. Sprent, J. Kattge, 


vation drives 


ion of symbiotic No-fixation in angiosperms. 


Nat. Commun. 5, 4087 (2014). doi: 10.1038/ncomms5087; 


pmid: 24912610 


. S. M. Swensen, D. R. Benson, in Nitrogen-fixing Actinorhizal 


Symbioses, K. Pawlowski, W. E. Newton, Eds. (Springer 


Netherlands, 2008), pp. 73-104. 


. H. L. Li et al., Large-scale phylogenet 


ic analyses reveal 


multiple gains of actinorhizal nitrogen-fixing symbioses in 
angiosperms associated with climate change. Sci. Rep. 5, 14023 


(2015), doi: 10.1038/srep14023; pmid: 


. J. A. Eisen, C. M. Fraser, Phylogenom 


26354898 
ics: Intersection 


of evolution and genomics. Science 300, 1706-1707 (2003). 
doi: 10.1126/science.1086292; pmid: 12805538 


. P. M. Delaux, G. Radhakrishnan, G. Oldroyd, Tracing the 


evolutionary path to nitrogen-fixing crops. Curr. Opin. Plant 


Biol. 26, 95-99 (2015). doi: 10.1016/j 
pmid: 26123396 


.pbi.2015.06.003; 


Materials and methods are available as supplementary 


materials. 


N. D. Young et al., The Medicago genome provides insight into 
the evolution of rhizobial symbioses. Nature 480, 520-524 
(2011). doi: 10.1038/naturel0625; pmid: 22089132 


. J. Schmutz et al., A reference genome for common bean and 


genome-wide analysis of dual domestications. Nat. Genet. 46, 
707-713 (2014). doi: 10.1038/ng.3008; pmid: 24908249 

J. Schmutz et al., Genome sequence of the palaeopolyploid 
soybean. Nature 463, 178-183 (2010). doi: 10.1038/ 


nature08670; pmid: 20075913 
D. M. Emms, S. Kelly, OrthoFinder: So! 
in whole genome comparisons drama 


ving fundamental biases 
ically improves 


orthogroup inference accuracy. Genome Biol. 16, 157 (2015). 
doi: 10.1186/s13059-015-0721-2; pmid: 26243257 


Nat. Rev. Genet. 14, 751-764 (2013). 
pmid: 24105273 
N. Shubin, C. Tabin, S. Carroll, Deep 


. D. L. Stern, The genetic causes of convergent evolution. 


doi: 10.1038/nrg3483; 


homology and the 


origins of evolutionary novelty. Nature 457, 818-823 (2009). 


doi: 10.1038/nature07891; pmid: 192 
J. J. Doyle, Chasing unicorns: Nodula 


2399 
ion origins and the 


paradox of novelty. Am. J. Bot. 103, 


865-1868 (2016). 


doi: 10.3732/ajb.1600260; pmid: 27756731 

J. F. Marsh et al., Medicago truncatula NIN is essential for 
rhizobial-independent nodule organogenesis induced by 
autoactive calcium/calmodulin-dependent protein kinase. Plant 
Physiol. 144, 324-335 (2007). doi: 10.1104/pp.106.093021; 


pmid: 17369436 


plan 


105, 4928-4932 (2008). doi: 10.1073 
pmid: 18316735 
L. Schauser, A. Roussis, J. Stiller, J. S 


F. Clavijo et al., The Casuarina NIN gene is transcriptionally 
activated throughout Frankia root infection as well as in 
response to bacterial diffusible signals. New Phytol. 208, 
887-903 (2015). doi: 10.1111/nph.13506; pmid: 26096779 
H. Gherbi et al., SymRK defines a common genetic basis for 
root endosymbioses with arbuscular mycorrhiza fungi, 
rhizobia, and Frankiabacteria. Proc. Natl. Acad. Sci. U.S.A. 


/pnas.0710618105; 


tougaard, A plant 


regulator controlling development of symbiotic root 


nodules. Nature 402, 191-195 (1999). 
pmid: 10647012 


doi: 10.1038/46058; 


A. van Zeijl et al., CRISPR/Cas9-mediated mutagenesis of four 
putative symbiosis genes of the tropical tree Parasponia 


9 of 11 


810z ‘S| Ainr uo /B0'beweoual0s-eoua!0s//:dyjy wos} papeojumoq 


PEJIU3 NOATOTOBUJIA TPYNNA "What's News" VK.COM/WSNWS 


RESEARCH | RESEARCH ARTICLE 


29. 


30. 


31 


32: 


33. 


34. 


35. 


36. 


37. 


38. 


39. 


40. 


4). 


42. 


43. 


4A. 


45. 


46. 


47. 


48. 


49. 


50. 


Griesmann et al., Science 361, eaat1743 (2018) 


andersonii reveals novel phenotypes. Front. Plant Sci. 9, 284 
(2018). doi: 10.3389/fpls.2018.00284; pmid: 29559988 

S. Svistoonoff et al., The independent acquisition of plant root 
nitrogen-fixing symbiosis in Fabids recruited the same genetic 
pathway for nodule organogenesis. PLOS ONE 8, e64515 
(2013). doi: 10.1371/journal.pone.0064515; pmid: 23741336 

K. Markmann, G. Giczey, M. Parniske, Functional adaptation of 
a plant receptor-kinase paved the way for the evolution of 
intracellular root symbioses with bacteria. PLOS Biol. 6, e68 
(2008). doi: 10.1371/journal.pbio.0060068; pmid: 18318603 
L. G. Nagy et al., Latent homology and convergent regulatory 
evolution underlies the repeated emergence of yeasts. 
Nat. Commun. 5, 4471 (2014). doi: 10.1038/ncomms5471, 
pmid: 25034666 

C. E. Conn et al., Convergent evolution of strigolactone 
perception enabled host detection in parasitic plants. 

Science 349, 540-543 (2015). doi: 10.1126/science.aab1140; 
pmid: 26228149 

J. He et al., The Medicago truncatula gene expression atlas web 
server. BMC Bioinformatics 10, 441 (2009). doi: 10.1186/ 
471-2105-10-441; pmid: 20028527 

G. Criscuolo, V. T. Valkov, A. Parlati, L. M. Alves, 

M. Chiurazzi, Molecular characterization of the Lotus japonicus 
NRTI(PTR) and NRT2 families. Plant Cell Environ. 35, 
567-1581 (2012). doi: 10.1111/j.1365-3040.2012.02510.x; 
pmid: 22458810 
L. Aravind, H. Watanabe, D. J. Lipman, E. V. Koonin, Lineage- 
specific loss and divergence of functionally linked genes in 
eukaryotes. Proc. Natl. Acad. Sci. U.S.A. 97, 11319-11324 
(2000). doi: 10.1073/pnas.200346997; pmid: 11016957 

R. Albalat, C. Cafiestro, Evolution by gene loss. 

Nat. Rev. Genet. 17, 379-391 (2016). doi: 10.1038/nrg.2016.39; 
pmid: 27087500 

P. M. Delaux et al., Comparative phylogenomics uncovers the 
impact of symbiotic associations on host genome evolution. 
PLOS Genet. 10, e1004487 (2014). doi: 10.1371/ 
journal.pgen.1004487; pmid: 25032823 

A. Y. Borisov et al., The Sym35 gene required for root nodule 
development in pea is an ortholog of Nin from Lotus japonicus. 
Plant Physiol. 131, 1009-1017 (2003). doi: 10.1104/ 
pp.102.016071; pmid: 12644653 

N. Azani et al., A new subfamily classification of the 
Leguminosae based on a taxonomically comprehensive 
phylogeny - The Legume Phylogeny Working Group (LPWG). 
Taxon 66, 44-77 (2017). doi: 10.12705/661.3 
J. |. Sprent, Evolving ideas of legume evolution and diversity: 
A taxonomic perspective on the occurrence of nodulation. 
New Phytol. 174, 11-25 (2007). doi: 10.1111/ 
j.1469-8137.2007.02015.x; pmid: 17335493 
J.-F. Arrighi et al., The RPG gene of Medicago truncatula 
controls Rhizobium-directed polar growth during infection. 
Proc. Natl. Acad. Sci. U.S.A. 105, 9817-9822 (2008). 
doi: 10.1073/pnas.0710273105; pmid: 18621693 

M. R. Chandler, Some observations on infection of Arachis 
hypogaea L. by Rhizobium. J. Exp. Bot. 29, 749-755 (1978). 
doi: 10.1093/jxb/29.3.749 
Z. Peng et al., Transcriptome profiles reveal gene regulation of 
peanut (Arachis hypogaea L.) nodulation. Sci. Rep. 7, 40066 
(2017). pmid: 28059169 

D. W. Gorbet, J. C. Burton, A non-nodulating peanut. 

Crop Sci. 19, 727-728 (1979). doi: 10.2135/ 
cropscil979.0011183X001900050045x 

S. De Mita, A. Streng, T. Bisseling, R. Geurts, Evolution 

of a symbiotic receptor through gene duplications in the 
legume-rhizobium mutualism. New Phytol. 201, 961-972 
(2014). doi: 10.1111/nph.12549; pmid: 24400903 
M. Parniske, Arbuscular mycorrhiza: The mother of plant 
root endosymbioses. Nat. Rev. Microbiol. 6, 763-775 (2008). 
doi: 10.1038/nrmicro1987; pmid: 18794914 
P. Favre et al., A novel bioinformatics pipeline to discover 
genes related to arbuscular mycorrhizal symbiosis based on 
their evolutionary conservation pattern among higher plants. 
BMC Plant Biol. 14, 333 (2014). doi: 10.1186/s12870-014-0333- 
0; pmid: 25465219 
A. Bravo, T. York, N. Pumplin, L. A. Mueller, M. J. Harrison, 
Genes conserved for arbuscular mycorrhizal symbiosis 
identified through phylogenomics. Nat. Plants 2, 15208 (2016). 
doi: 10.1038/nplants.2015.208; pmid: 27249190 

Z. Yang, PAML 4: Phylogenetic analysis by maximum 
likelihood. Mol. Biol. Evol. 24, 1586-1591 (2007). doi: 10.1093/ 
molbev/msm088; pmid: 17483113 

S. B. Carroll, Evo-devo and an expanding evolutionary 
synthesis: A genetic theory of morphological evolution. 


51. 


52. 


53. 


54. 


55. 


56. 


57. 


58. 


59. 


60. 


61. 


62. 


63. 


64. 


65. 


66. 


67. 


68. 


69. 


70. 


71. 


72. 


Cell 134, 25-36 (2008). doi: 10.1016/j.cell.2008.06.030; 
pmid: 18614008 

R. van Velzen et al., Comparative genomics of the nonlegume 
Parasponia reveals insights into evolution of nitrogen-fixing 
rhizobium symbioses. Proc. Natl. Acad. Sci. U.S.A. 115, 
E4700-E4709 (2018). pmid: 29717040 

L. G. Nagy, Evolution: Complex multicellular life with 5,500 
genes. Curr. Biol. 27, R609-R612 (2017). doi: 10.1016/ 
j.cub.2017.04.032; pmid: 28633034 

M. A. O'Malley, J. G. Wideman, |. Ruiz-Trillo, Losing 
complexity: The role of simplification in macroevolution. 
Trends Ecol. Evol. 31, 608-621 (2016). doi: 10.1016/ 
j.tree.2016.04.004; pmid: 27212432 

D. S. LeBauer, K. K. Treseder, Nitrogen limitation of net 
primary productivity in terrestrial ecosystems is globally 
distributed. Ecology 89, 371-379 (2008). doi: 10.1890/ 
06-2057.1; pmid: 18409427 

J. Streeter, P. P. Wong, Inhibition of legume nodule formation 
and N>-fixation by nitrate. Crit. Rev. Plant Sci. 7, 1-23 (1988). 
doi: 10.1080/07352688809382257 

E. T. Kiers, R. A. Rousseau, S. A. West, R. F. Denison, Host 
sanctions and the legume-rhizobium mutualism. Nature 425, 
78-81 (2003). doi: 10.1038/nature01931; pmid: 12955144 

H. Fujita, S. Aoki, M. Kawaguchi, Evolutionary dynamics 

of nitrogen fixation in the legume-rhizobia symbiosis. 

PLOS ONE 9, e93670 (2014). doi: 10.1371/ 
journal.pone.0093670; pmid: 24691447 

L. Carro, P. Pujic, M. E. Trujillo, P. Normand, Micromonospora 

is anormal occupant of actinorhizal nodules. J. Biosci. 38, 685-693 
(2013). doi: 10.1007/s12038-013-9359-y;pmid: 24287646 

J. |. Sprent, Legume Nodulation: A Global Perspective 

(Wiley, 2009). 

R. Zgadzaj et al., Root nodule symbiosis in Lotus japonicus 
drives the establishment of distinctive rhizosphere, root, and 
nodule bacterial communities. Proc. Natl. Acad. Sci. U.S.A. 113, 
E7996-E8005 (2016). doi: 10.1073/pnas.1616564113; pmid: 
27864511 
G. Margais, C. Kingsford, A fast, lock-free approach for efficient 
parallel counting of occurrences of k-mers. Bioinformatics 27, 
764-770 (2011). doi: 10.1093/bioinformatics/btr011; 

pmid: 21217122 
E. Veeckman, T. Ruttink, K. Vandepoele, Are we there yet? 
Reliably estimating the completeness of plant genome 
sequences. Plant Cell 28, 1759-1768 (2016). doi: 10.1105/ 
pc.16.00349; pmid: 27512012 

R. Arratia, D. Martin, G. Reinert, M. S. Waterman, Poisson 
process approximation for sequence repeats, and sequencing 
by hybridization. J. Comput. Biol. 3, 425-463 (1996). 

doi: 10.1089/cmb.1996.3.425; pmid: 8891959 

R. Luo et al., SOAPdenovo2: An empirically improved memory- 
efficient short-read de novo assembler. Gigascience 1, 18 
(2012). doi: 10.1186/2047-217X-1-18; pmid: 23587118 

R. Kajitani et al., Efficient de novo assembly of highly 
heterozygous genomes from whole-genome shotgun short 
reads. Genome Res. 24, 1384-1395 (2014). doi: 10.1101/ 
gr.170720.113; pmid: 24755901 

E. W. Myers et al., A whole-genome assembly of Drosophila. 
Science 287, 2196-2204 (2000). doi: 10.1126/ 
science.287.5461.2196; pmid: 10731133 

M. Boetzer, C. V. Henkel, H. J. Jansen, D. Butler, W. Pirovano, 
Scaffolding pre-assembled contigs using SSPACE. 
Bioinformatics 27, 578-579 (2011). doi: 10.1093/ 
bioinformatics/btq683; pmid: 21149342 

F. A. Simao, R. M. Waterhouse, P. loannidis, E. V. Kriventseva, 
E. M. Zdobnov, BUSCO: Assessing genome assembly and 
annotation completeness with single-copy orthologs. 
Bioinformatics 31, 3210-3212 (2015). doi: 10.1093/ 
bioinformatics/btv351; pmid: 26059717 

A. Smit, R. Hubley, P. Green, RepeatMasker Open-4.0, 
2013-2015; www.repeatmasker.org. 

Y. Han, S. R. Wessler, MITE-Hunter: A program for discovering 
miniature inverted-repeat transposable elements from 
genomic sequences. Nucleic Acids Res. 38, e199 (2010). 

doi: 10.1093/nar/gkq862; pmid: 20880995 
D. Ellinghaus, S. Kurtz, U. Willhoeft, LTRharvest, an efficient 
and flexible software for de novo detection of LTR 
retrotransposons. BMC Bioinformatics 9, 18 (2008). 

doi: 10.1186/1471-2105-9-18; pmid: 18194517 
G. Gremme, S. Steinbiss, S. Kurtz, GenomeTools: 

A comprehensive software library for efficient processing of 
structured genome annotations. IEEE/ACM Trans. Comput. 
Biol. Bioinform. 10, 645-656 (2013). doi: 10.1109/ 
TCBB.2013.68; pmid: 24091398 


13 July 2018 


73 


74. 


75. 


76. 


77. 


78. 


79. 


80. 


81. 


82. 


83. 


84. 


85. 


86. 


87. 


88. 


89. 


90. 


91. 


92. 


93. 


94. 


. R. C. Edgar, MUSCLE: Multiple sequence alignment with high 
accuracy and high throughput. Nucleic Acids Res. 32, 
1792-1797 (2004). doi: 10.1093/nar/gkh340; pmid: 15034147 
M. S. Campbell et al., MAKER-P: A tool kit for the rapid 
creation, management, and quality control of plant genome 
annotations. Plant Physiol. 164, 513-524 (2014). doi: 10.1104/ 
pp.113.230144; pmid: 24306534 

B. J. Haas et al., Improving the Arabidopsis genome 
annotation using maximal transcript alignment assemblies. 
Nucleic Acids Res. 31, 5654-5666 (2003). doi: 10.1093/ 
nar/gkg770; pmid: 14500829 
M. Stanke, R. Steinkamp, S. Waack, B. Morgenstern, 
AUGUSTUS: A web server for gene finding in eukaryotes. 
Nucleic Acids Res. 32, W309-W312 (2004). doi: 10.1093/ 
nar/gkh379; pmid: 15215400 

A. V. Lukashin, M. Borodovsky, GeneMark.hmm: New solutions 
‘or gene finding. Nucleic Acids Res. 26, 1107-1115 (1998). 
doi: 10.1093/nar/26.4.1107; pmid: 9461475 

. Korf, Gene finding in novel genomes. BMC Bioinformatics 5, 
59 (2004). doi: 10.1186/1471-2105-5-59; pmid: 15144565 

M. G. Grabherr et al., Full-length transcriptome assembly 
rom RNA-Seq data without a reference genome. Nat. 
Biotechnol. 29, 644-652 (2011). doi: 10.1038/nbt.1883; 
pmid: 21572440 

E. M. Zdobnov, R. Apweiler, InterProScan—an integration 
platform for the signature-recognition methods in InterPro. 
Bioinformatics 17, 847-848 (2001). doi: 10.1093/ 
bioinformatics/17.9.847; pmid: 11590104 

W. Li, A. Godzik, Cd-hit: A fast program for clustering and 
comparing large sets of protein or nucleotide sequences. 
Bioinformatics 22, 1658-1659 (2006). doi: 10.1093/ 
bioinformatics/btl158; pmid: 16731699 

Y. Yang, S. A. Smith, Orthology inference in nonmodel 
organisms using transcriptomes and low-coverage genomes: 
Improving accuracy and matrix occupancy for phylogenomics. 
Mol. Biol. Evol. 31, 3081-3092 (2014). doi: 10.1093/ 
molbev/msu245; pmid: 25158799 

V. Lefort, R. Desper, 0. Gascuel, FastME 2.0: A comprehensive, 
accurate, and fast distance-based phylogeny inference 
program. Mol. Biol. Evol. 32, 2798-2800 (2015). doi: 10.1093/ 
molbev/msv150; pmid: 26130081 

J. Huerta-Cepas, F. Serra, P. Bork, ETE 3: Reconstruction, 
analysis, and visualization of phylogenomic data. Mol. Biol. 
Evol. 33, 1635-1638 (2016). doi: 10.1093/molbev/msw046; 
pmid: 26921390 
K. Katoh, D. M. Standley, MAFFT multiple sequence alignment 
software version 7: Improvements in performance and 
usability. Mol. Biol. Evol. 30, 772-780 (2013). doi: 10.1093/ 
molbev/mst010; pmid: 23329690 

G. Tan et al., Current methods for automated filtering of 
multiple sequence alignments frequently worsen single-gene 
phylogenetic inference. Syst. Biol. 64, 778-791 (2015). 

doi: 10.1093/sysbio/syv033; pmid: 26031838 

A. Stamatakis, RAxML version 8: A tool for phylogenetic 
analysis and post-analysis of large phylogenies. Bioinformatics 
30, 1312-1313 (2014). doi: 10.1093/bioinformatics/btu033; 
pmid: 24451623 

V. Solovyev, in Handbook of Statistical Genetics, D. J. Balding, 
M. Bishop, C. Cannings, Eds. (Wiley, 2008), pp. 97-159. 

C. D. Bell, D. E. Soltis, P. S. Soltis, The age and diversification 
of the angiosperms re-revisited. Am. J. Bot. 97, 1296-1303 
(2010). doi: 10.3732/ajb.0900346; pmid: 21616882 

Z. Xi et al., Phylogenomics and a posteriori data partitioning 
resolve the Cretaceous angiosperm radiation Malpighiales. 
Proc. Natl. Acad. Sci. U.S.A. 109, 17519-17524 (2012). 

doi: 10.1073/pnas.1205818109; pmid: 23045684 

M. T. J. Johnson et al., Evaluating methods for isolating total 
RNA and predicting the success of sequencing phylogenetically 
diverse plant transcriptomes. PLOS ONE 7, e50226 (2012). 
doi: 10.1371/journal.pone.0050226; pmid: 23185583 

M. V. Han, G. W. C. Thomas, J. Lugo-Martinez, M. W. Hahn, 
Estimating gene gain and loss rates in the presence of 

error in genome assembly and annotation using CAFE 3. 

Mol. Biol. Evol. 30, 1987-1997 (2013). doi: 10.1093/molbev/ 
mstl00; pmid: 23709260 

L.-T. Nguyen, H. A. Schmidt, A. von Haeseler, B. Q. Minh, 
IQ-TREE: A fast and effective stochastic algorithm for estimating 
maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268-274 
(2015). doi: 10.1093/molbev/msu300; pmid: 25371430 

D. T. Hoang et al., MPBoot: Fast phylogenetic maximum 
parsimony tree inference and bootstrap approximation. BMC 
Evol. Biol. 18, 11 (2018). doi: 10.1186/s12862-018-1131-3; 
pmid: 29390973 


10 of 11 


8102 ‘S| Aine uo /Bio'beweousloseoual0s//:dyjy wos} papeojumoq 


PEJIU3 NOATOTOBUJIA TPYNNA "What's News" VK.COM/WSNWS 


RESEARCH | RESEARCH ARTICLE 


95. S. Kalyaanamoorthy, B. Q. Minh, T. K. F. Wong, A. von Haeseler, 
L. S. Jermiin, ModelFinder: Fast model selection for accurate 
phylogenetic estimates. Nat. Methods 14, 587-589 (2017). 
doi: 10.1038/nmeth.4285; pmid: 28481363 

96. M. Griesmann, Y. Chang, X. Liu, Y. Song, G. Haberer, 

M. B. Crook, B. Billault-Penneteau, D. Lauressergues, 

L. Imanishi, Y. P. Roswanjaya, W. Kohlen, P. Pujic, 

K. Battenberg, N. Alloisio, W. Sun, H. Hilhorst, M. G. Salgado, 
V. Hocher, H. Gherbi, S. Svistoonoff, J. J. Doyle, S. He, Y. Xu, 
W. Xian, Y. Fu, P. Normand, A. M. Berry, L. G. Wall, J. M. Ané, 
K. Pawlowski, X. Xu, H. Yang, M. Spannagl, K. F. X. Mayer, 

G. K. Wong, M. Parniske, P. M. Delaux, S. Cheng, Data 
supporting multiple independent losses of nitrogen-fixing root 
nodule symbiosis. GigaDB (2018); doi: 10.5524/100300 


ACKNOWLEDGMENTS 


The authors thank J. Sprent and G. Werner for initial discussions; 
R. Hagen and S. Singer for C. fasciculata leaves; S. Cannon for 

C. canadensis leaves; N. Séjalon-Delmas (Toulouse Botanical 
Garden), the International Center for Tropical Agriculture (CIAT), 
and USDA ARS-GRIN for seeds; and D. Lang for bioinformatics 
advice. Funding: This work was partly performed in the LRSV, 
which belongs to the TULIP LABEX (ANR-10-LABX-41), and was 
supported by grants from the Shenzhen Municipal Government 
of China to S.C. (JCYJ20150529150409546) and Y.C. 
(JCYJ20150831201643396); a grant from the Swedish Research 
Council Vetenskapsradet (VR 2012-03061) to K.P.; the German 
Ministry of Education and Research grant 031A536 “de.NBI” 

and the Deutsche Forschungsgemeinschaft Project SFB924 to 
K.F.X.M. and M.P.; grants from the National Science Foundation 
(NSF 1237936 and 1546742) to J.-M.A.; grants from the French 
National Research Agency (ANR-12-BSV7-0007-02), the USDA 
(NIFA 2015-67014-22849), and ECOS-SUD (A13B03) to V.H., H.G., 
and S.S.; a grant from the National University of Quilmes, 
Argentina (PUNQ 1411/15) to L.G.W.; and by USDA grant NIFA590 
CA-D-PLS-2173-H to A.M.B. W.K. is supported by NWO-VENI 
(863.15.010), and L.G.W. and L.|. are members of CONICET 
(Argentina). G.K.-S.W. is funded by Alberta Innovates Technology 


Griesmann et al., Science 361, eaat1743 (2018) 


Futures (AITF) Innovates Centres of Research Excellence (iCORE). 
M.P. acknowledges funding by ERC Advanced grant 340904 
(EvolvingNodules), which supported the work of M.G. and B.B.-P. 
and the sabbatical stay of K.P. at the LMU Munich. Author 
contributions: K.P., M.P., P.-M.D., and J.-M.A. defined the species 
to sequence and designed the project. K.P. coordinated the work 
on actinorhizal species, supervised M.G.S., isolated DNA for Datisca 
glomerata and Begonia fuchsioides, and extracted RNA o 
B. fuchsioides. M.G.S. prepared RNA of D. glomerata. P.-M.D. 
coordinated the consortium, provided DNA and RNA samples 
for N. schottii, conducted phylogenetic analyses of target genes, 
and supervised D.L. and J.K. D.L. extracted DNA for and 
conducted the experiment presented in fig. S5. J.K. conducted 
the PAML analysis on NIN and RPG. J.-M.A. supervised 
experiments on most legumes. M.B.C. extracted DNA and RNA of 
C. fasciculata, M. pudica, and C. canadensis. W.K. and Y.P.R. 
extracted DNA samples for C. australe, Sophora flavescens, and the 
Chamaecrista species used in fig. S5. L.G.W. collected D. trinervis 
plant material and supervised L.I. L.l. prepared DNA and RNA 
samples for D. trinervis. P.N. supervised experiments on 

Alnus glutinosa. N.A. and P.P. collected A. glutinosa material, 
designed extraction protocols, and extracted RNA (N.A.) and 

DNA (P.P.). H.G., V.H., and S.S. collected C. glauca material, 
supervised DNA extraction, and extracted RNA for C. glauca. K.F.X. 
M. and M.S. coordinated genome assembly and annotation. 

M.S. and G.H. supervised M.G. G.H. performed genome assembly, 
gene annotation, phylogenetic analyses, and prepared Fig. 1. 

M.P. initiated the consortium and supervised M.G. and B.B.-P. and 
the work on Dryas drummondii. M.G. conducted most of the 
phylogenetic analyses and prepared Figs. 2 and 3 and related 
supplementary texts and files. B.B.-P. collected material and 
extracted DNA and RNA from D. drummondii. S.C. and G.K.-S.W. 
designed the sequencing strategy and coordinated and supervised 
the bioinformatics and genomic analysis, including genome 
sequencing, assembly, and annotation at BGI-Shenzhen with H.Y., 
XL. Q.G., X.F., and X.X. X.L. performed genome analyses. 

Y.C. performed samples management and data quality control 

and developed custom bioinformatics pipelines for genome assembly, 


13 July 2018 


annotation, and evaluation with S.X. S.X. managed the super 
computer clusters to perform assembly and annotation. Y.S. identified 
orthologs for Fig. 3 and performed most of the microsynteny 
analyses. Y.L. ran RNA data quality control, assembly, alignment, and 
annotation. S.H. finalized the synteny analysis and prepared Fig. 3. 
Y.X., J.Q., and Y.F. performed genome assembly, repeat, and gene 
annotation, respectively, for M. pudica, A. glutinosa, and C. canadensis 
(Y.X.); N. schottii and D. trinervis (J.G.); and C. glauca and 

C. fasciculata (Y.F.). H.H. provided access to the unpublished 
genome of C. australe. J.J.D. provided comments and suggestions 
on the possible evolutionary scenarios and contributed to writing 
the manuscript. A.M.B. and K.B. provided material and comments on 
the experimental strategy. S.C. wrote the supplementary texts with 
contributed sections from all co-authors. M.P. and P.-M.D. wrote the 
manuscript with inputs and comments from co-authors. Competing 
interests: The authors declare no competing interests. Data and 
materials availability: All raw reads have been deposited in the 
Sequence Read Archive (SRA) under project PRJNA374770; accession 
codes are summarized in the supplementary materials. All files have 
been deposited in GigaDB (96). All material used for genome 
sequencing is available on request to the responsible author as 
indicated in data S2. All materials used for genome sequencing are 
available on request to P.N. for A. glutinosa; K.P. for D. glomerata and 
B. fuchioides; S.S., H.G., or V.H. for C. glauca; J.-M.A. or M.B.C. for 

C. canadensis, C. fasciculata, and M. pudica; L.G.W. for D. trivernis; M.P. 
for D. drummonaii; and P.-M.D. for N. schottii (also listed in data S2). 


SUPPLEMENTARY MATERIALS 
www.sciencemag.org/content/361/6398/eaatl743/supp|/DC1 
Materials and Methods 
Figs. Sl to S27 

Tables S1 to S37 
References (97-103) 
Data S1 and S2 


12 February 2018; accepted 16 May 2018 
Published online 24 May 2018 
10.1126/science.aatl743 


11 of 11 


8L0z ‘S| Aine uo /Bio'beweoual0seoua!0s//:dijy wos papeojumoq 


PENU3 NOAFOTOBWIA FPYNNA "What's News" VK.COM/WSNWS 


RESEARCH 


RESEARCH ARTICLE SUMMARY 


STRUCTURAL DYNAMICS 


Retinal isomerization in 
bacteriorhodopsin captured by 
a femtosecond x-ray laser 


Przemyslaw Nogly, Tobias Weinert, Daniel James, Sergio Carbajo, Dmitry Ozerov, 
Antonia Furrer, Dardan Gashi, Veniamin Borin, Petr Skopintsev, Kathrin Jaeger, 
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INTRODUCTION : Retinal is a light-sensitive 
protein ligand that is used by all domains of life 
to process the information and energy content 
of light. Retinal-binding proteins are integral 
membrane proteins that drive vital biological 
processes, including light sensing for spatial 
orientation and circadian clock adjustment, as 
well as maintaining electrochemical gradients 
through ion transport. They also form the basis 
for optogenetic manipulation of neural cells. 
How the protein environment guides retinal 


All-trans retinal - - ------------------ 


Retinal 


isomerization on a subpicosecond time scale 
toward a single high-yield product is a funda- 
mental outstanding question in photobiology. 


RATIONALE: Light-induced isomerization of 
retinal is among the fastest reactions known 
in biology. It has been widely studied by spec- 
troscopic techniques to probe the evolution of 
spectral intermediates over time. Using x-ray 
free-electron lasers (XFELs), it is now possible 
to observe ultrafast photochemical reactions 


------------------- ~ 13-cis retinal 


Mang Arn Arne 


Ground state 49-406 fs 


457-646 fs 10 ps 


Time-resolved serial crystallography resolves ultrafast atomic motions of retinal 


and the surrounding protein following photoexcitation. Retinal evolves from an 
all-trans conformation in the ground state toward a twisted 13-cis retinal over 

the course of a few hundred femtoseconds. The complex counterion, formed by 

two aspartic acid residues (Asp) and a water molecule (Wat), responds to changes 

in the electronic structure of the chromophore on the same time scale as the formation 
of the excited state. 
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and their induced molecular motions within 
proteins on scales of femtoseconds to milli- 
seconds with near-atomic structural resolution. 
In this work, we used XFEL radiation to study 
the structural dynamics of retinal isomeriza- 
tion in the light-driven proton-pump bacterio- 
rhodopsin (bR). The principal mechanism of 
isomerization in this prototypical retinal-binding 
protein has direct relevance for all other mem- 
bers of this important family of membrane pro- 
teins, and it provides insight into how protein 
environments catalyze photochemical reactions 
in general. 


RESULTS: We collected high-resolution x-ray 
diffraction data from bR microcrystals injected 
across the femtosecond x-ray pulses of the 
Linac Coherent Light Source after excitation 

of the retinal chromophore 
by an optical laser pulse. 
Read the full article Tay diffraction images 
were sorted into tempo- 
ral subgroups with a pre- 
cision of about 200 fs. A 
series of 18 overlapping 
difference Fourier electron density maps re- 
veal structural changes over the first picosecond 
of retinal photoexcitation. Complementary data 
for time delays of 10 ps and 8.33 ms allow us to 
resolve the later stages of the reaction. In com- 
bination with refined crystallographic structures 
at pump-probe delays corresponding to where 
the spectroscopically characterized I, J, K, and 
M intermediates form in solution, our time- 
resolved structural data reveal the trajectory of 
retinal isomerization and provide atomic de- 
tails at key points along the reaction. 

The aspartic acid residues of the retinal 
counterion and functional water molecules in 
close proximity to the retinal Schiff base respond 
collectively to the formation and decay of the 
excited state. This collective motion sets the 
stage for retinal isomerization, which proceeds 
via a twisted retinal configuration. Quantum 
mechanics/molecular mechanics simulations 
provide theoretical support for this structural 
evolution. 


at http://dx.doi. 
org/10.1126/ 
science.aat0094 


CONCLUSION: Our observations reveal how, 
concomitant with the formation of the earliest 
excited state, the retinal-binding pocket opens 
up in close proximity to the isomerizing bond. 
We propose that ultrafast charge transfer along 
retinal is a driving force for collective motions 
that contribute to the stereoselectivity and effi- 
ciency of retinal isomerization within a protein 
scaffold. Vibrational quake-like motions extend- 
ing from retinal to the protein may also be a 
mechanism through which excess energy is 
released in a nonradiative fashion. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: joerg.standfuss@psi.ch 

Cite this article as P. Nogly et al., Science 361, eaat0094 
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Gebhard Schertler’’”, Richard Neutze’, Jérg Standfuss’* 


Ultrafast isomerization of retinal is the primary step in photoresponsive biological 
functions including vision in humans and ion transport across bacterial membranes. 

We used an x-ray laser to study the subpicosecond structural dynamics of retinal 
isomerization in the light-driven proton pump bacteriorhodopsin. A series of structural 
snapshots with near-atomic spatial resolution and temporal resolution in the femtosecond 
regime show how the excited all-trans retinal samples conformational states within the protein 
binding pocket before passing through a twisted geometry and emerging in the 13-cis 
conformation. Our findings suggest ultrafast collective motions of aspartic acid residues and 
functional water molecules in the proximity of the retinal Schiff base as a key facet 

of this stereoselective and efficient photochemical reaction. 


rganisms harvest light for its energy and 
information content. Seven-transmembrane- 
helix retinal proteins use the absorption 
of photons to achieve both of these pur- 
poses across all domains of life. Family 
members include visual rhodopsins (light recep- 
tors triggering vision) in animals, as well as bac- 
teriorhodopsin and proteorhodopsins (proton 
pumps), halorhodopsins (anion pumps), chan- 
nelrhodopsins (gated ion channels), and sensory 
rhodopsin (phototaxis receptors) in archaea and 
bacteria. Retinal-binding ion pumps and chan- 
nels have found exciting applications in the 
optogenetic manipulation of neural cells (J). 
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Activation of seven-transmembrane-helix retinal 
proteins is initiated by the photochemical trans- 
to-cis (or cis-to-trans for visual rhodopsins) isom- 
erization of the conjugated double bond system of 
the retinal chromophore. Photo-isomerization is 
one of the fastest reactions in biology, being 
completed within a few picoseconds after photon 
absorption. Retinal isomerization occurs stereo- 
selectively at a specific double bond within the 
binding pocket of the protein host and with a 
quantum yield of up to 67% (2, 3). In contrast, 
illumination of all-trans retinal in solution leads to 
formation of a mixture of different stereoisomers— 
13-cis, 11-cis, and 9-cis—with a quantum yield 
of only a few percent for each subproduct (4). 
How the protein scaffold rapidly guides retinal 
isomerization toward a single high-yield product 
is an outstanding and fundamental question in 
photobiology. 

Bacteriorhodopsin (bR) is an archetypical pro- 
ton pump that has long served as a system for 
understanding how the energy of a captured pho- 
ton may be used to achieve unidirectional proton 
transport against a transmembrane proton con- 
centration gradient. Chemical insights into retinal 
isomerization have emerged from ultrafast spec- 
troscopy, which identified the I, J, and K inter- 
mediates rising and decaying in the ultrafast time 
regime (Fig. 1) (5-8). Hybrid quantum mechanics/ 
molecular mechanics (QM/MM) simulations have 
provided further insight into the initial steps of 
retinal isomerization, including predictions of 
energy and charge redistributions (9). Despite 


these advances, theoretical and spectroscopic 
studies require complementary structural infor- 
mation to build a complete picture of the highly 
efficient photo-isomerization of retinal within 
proteins. Researchers have described reaction- 
cycle intermediates from x-ray structures after 
illuminating crystals with visible light at low 
temperatures or changing their pH (10). Such 
intermediate-trapping studies, however, have 
yielded conflicting results for the earliest avail- 
able structural intermediate, K (7-13). Impor- 
tantly, they cannot address the first events upon 
photo-isomerization because the low thermal 
energy barriers are not rate-limiting on the ul- 
trafast time scale. 

With the advent of x-ray free-electron lasers 
(XFELs), time-resolved serial femtosecond crystal- 
lography (TR-SFX) has emerged as a powerful 
method to study ultrafast structural changes in 
proteins (/4). TR-SFX was validated against time- 
resolved Laue diffraction recorded at a synchro- 
tron radiation source, using photoactive yellow 
protein (PYP) as a model system (15). The method 
has further provided insight into ultrafast struc- 
tural changes during the photodissociation of 
carbon monoxide from the active site of myoglobin 
(16), isomerization of p-coumaric acid in PYP 
(17), and isomerization of hydroxybenzylidene 
imidazolinone in rsEGFP2 (reversibly photoswitch- 
able green fluorescent protein 2) (18). TR-SFX has 
also tracked the evolution of protein and water 
molecule rearrangements in bR from nanoseconds 
to milliseconds, from which a coherent picture of 
proton pumping has emerged (19). We used this 
powerful new methodology to characterize the 
photo-isomerization of retinal and the imme- 
diate adaptations of its protein binding pocket. 

X-ray diffraction data were collected to 1.50-A 
resolution (table S1) from a continuous stream of 
light-adapted bR microcrystals grown in a lipidic 
cubic phase (LCP) (20, 27) at the coherent x-ray 
imaging (CXI) beamline (22) of the Linac Co- 
herent Light Source (LCLS) (23). About 1 million 
indexable diffraction patterns were collected in a 
pump-probe scheme using a sequence of four 
x-ray probe pulses between every optical pump 
laser pulse (fig. S1) (24). The x-ray diffraction 
images immediately following the optical laser 
pulse at four nominal delays were sorted into 
subgroups based on the LCLS timing tool signal 
(25) with a temporal resolution estimated to be 
on the order of 200 fs (16, 17) (fig. S2). Sequential 
windows of data within the first picosecond 
[overlapping pump-probe delay (A?) ranges with 
~30,000 sorted diffraction patterns], together 
with data collected at At = 10 ps and At = 8.33 ms, 
yielded 20 snapshots of bR activation. 


Trajectory of retinal isomerization 


A long-distance overview of electron density 
changes (Fop¢l!8* - Foys@2"*) in bR during the 
course of the first 10 ps is shown in movie S1. 
Changes are already visible for the earliest time 
point and are initially clustered at the retinal and 
its binding pocket. A close-up view of the reti- 
nal chromophore (Fig. 2 and movie S2) reveals 
negative density features that are visible above 
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Fig. 1. Early events in bacteriorhodopsin 

(bR) activation. (A) Structural changes 

in the seven-transmembrane-helix 

proton-pump bR are initiated by the 
trans-to-cis isomerization of the covalently 
bound retinal chromophore. The arrows 

indicate the direction of H* transport from 

the cytoplasmic side (IC) to the extracellular 
side (EC) of the membrane. A hydrogen-bonding 
network including the complex counterion 
network and several water molecules is 

critical for maintaining the high affinity of the 
Schiff base (SB) link for protons. (B) Serial 
femtosecond crystallography allows us to 
resolve the retinal molecule and its binding 
pocket with near-atomic resolution (2F op; — 
electron density of the light-adapted resting 
state at 1.5-A resolution shown in gray at 20) at 
ambient temperature and in the membrane-like 
environment of lipidic cubic phases. The atoms 
C13 and C14 are labeled to indicate the bond 
undergoing isomerization. (C) The retinal 
chromophore (sticks with van der Waals radii 
shown as translucent halos) is tightly encased 
within the binding pocket of the bR dark state 
(yellow). (D) Retinal isomerization is an ultrafast 
event progressing within picoseconds from the 
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bR dark state through the | and J intermediates to the K intermediate. | forms within about 200 fs after excitation and is characterized by a rapid 
rise in the retinal dipole moment (5) and twisting of the polyene backbone (6). After about 500 fs, the J intermediate evolves, which initiates the 
isomerization reaction (7) and turns into the K state with isomerized 13-cis retinal after about 3 ps (8). The ultrafast dynamic interplay between 
retinal and its binding pocket increases quantum efficiency and guides stereoselectivity of retinal isomerization. 


Fig. 2. Femtosecond structural dynamics 
along the retinal chromophore. (A) The 
potential energy diagram (energy, y axis; 
reaction coordinates, x axis) indicates how 
absorption of a photon promotes all-trans 
retinal to the Franck-Condon point (FC). The 
excitation energy is kinetically dissipated 

after passage through the conical intersection 
(Cl). Trans-cis isomerization occurs along 

local energy minima correlating with the I, J, 
and K spectroscopic intermediates. According 
to the commonly accepted model, the transition 
of the excited-state intermediate | to the first 
ground-state product J occurs within about 
0.5 ps and coincides with the onset of retinal 
isomerization (7, 27, 59). (B to E) The progres- 
sion of difference Fourier electron density 
(Fopsl@"t — Fops“*"*, contoured at 40; gold, 
negative; blue, positive) displayed along a 
dark-state model allows us to follow the 
trans-cis transition and see how the double 
bond system is sampled immediately after 
photoactivation, before selective isomerization 
around the C13=C14 bond occurs. The process 
results in an energetically elevated 13-cis isomer 
that is still twisted by the protein binding pocket 
10 ps after activation. All temporal snapshots of 
this trajectory are available as movie S2. 


40 (o is the root mean square deviation of elec- 
tron density) on the carbon atoms of the retinal 
polyene backbone (associated with C8, C10, C12, 
and C14) and corresponding weaker positive fea- 
tures (2.70 to 4.40; table S3) for At < 500 fs, but 
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Sampling 


Selective 
isomerization 


all-trans 


which decay for later time delays. Theoretical 
analyses have argued that photon absorption 
induces a partial charge relocation in retinal from 
the Schiff base (SB) toward the B-ionone ring in 
the excited state and before the trans-to-cis 


Twisted 
product 


Lys216 


Trp86 


Te ty wal 
Wat402 


isomerization (9). Measurements of transient 
changes in the excitonic coupling between the 
retinal dipole moment and nearby tryptophan 
residues imply that this charge redistribution oc- 
curs within 200 fs after photon absorption (5). 
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Simultaneously, nuclear rearrangements lead 
to an extension of double bonds and contraction 
of the single bonds (so-called inversion of the 
bond length alternation), which allows the planar 
geometry of retinal to distort in the excited state 
(26). Such distortion is consistent with the nega- 
tive and (weaker) positive features of the dif- 
ference Fourier electron density map (F,p</8"" - 


Movie 1. Retinal isomerization in refined 
structures with extrapolated electron 
density (2F..4 — Fcaic, contoured at 1.20). 


Movie 2. Dynamic view of retinal isomerization 
and response of the counterion network. The 
movie shows the transitions between four struc- 
tural intermediates: ground state, At = 49 to 

406 fs, At = 457 to 646 fs, and At = 10 ps. The 
morphs between the refined structures were 
generated with Chimera (60) and reveal a fast 
motion proceeding through the entire structure, 
followed by a slower rearrangement. A zoom-in to 
the region of the counterion network highlights the 
hydrogen-bond breakage between the SB and 
Wat*0? (\W 402). The detailed view of the retinal 
isomerization visualizes the structural rearrange- 
ments as the isomerization proceeds. Time scales 
in the movie do not correlate to real time scales, 
and the structures are linear morphs of atom 
motions. The 457 to 646 fs > 10 ps transition is 
shown more slowly than the other two transitions 
to highlight the longer time scale of the transition. 


Nogly et al., Science 361, eaat0094 (2018) 


13 July 2018 


F ops") observed around the C7 to C14 atoms of 
the retinal on this time scale (Fig. 2). 

The retinal initially samples possible isomeri- 
zation geometries for At < 500 fs, but the possible 
evolution is heavily constrained by the rigidity of 
the binding pocket. After this sampling of double 
bond geometries in the initial excited state, elec- 
tron density changes focus around the C13=C14: 
double bond, suggesting a coordinated evolution 
throughout the trans-cis isomerization. The pro- 
gress of retinal isomerization can be followed in 
extrapolated electron density maps (fig. S3 and 
Movie 1). Changes over time are best visualized 
by overlaying the difference Fourier electron den- 
sity map with the resting and refined structures 
for the time ranges At = 49 to 406 fs, 457 to 646 fs, 
10 ps, and 8.33 ms (table S2) with Fig. 3. These 
time windows correlate approximately with ac- 
cumulation of the spectroscopically distinct I, 
J, K, and M intermediates identified in solution 
(5-8, 27, 28). In the earliest temporal windows, 
paired difference density features indicate that 
the retinal SB nitrogen is already displaced toward 
helix G, which is quantified by structural refine- 
ment as a 0.5-A sideways motion of the SB but 
with the retinal geometry otherwise remaining 
similar to the all-trans conformation. For At = 


sampling \| ? 
‘ all-trans 


{ Wat402 
displaced 


457 to 646 fs, a stronger continuous negative 
difference is apparent along the C20-C13=C14 
bonds, and two positive difference density peaks 
flank a negative feature on the SB nitrogen. This 
time point is modeled as a twisted retinal con- 
figuration with the C12-C13=C14-C15 torsion angle 
~90° and the C15=NC bond lying in the plane of 
the membrane. Overall changes between the dark 
and At = 457 to 646 fs time delay twist the tor- 
sion angles of the C13=C14 and the neighboring 
Cl1=C12 and C15=NC bonds in opposite direc- 
tions, indicating that the reaction proceeds ac- 
cording to the aborted bicycle-pedal model of 
retinal isomerization (9, 29, 30). 

By At = 10 ps, only one positive feature remains 
in the difference Fourier electron density map 
that is complementary to the negative peak on 
the SB nitrogen atom, and paired positive and 
negative density features are associated with the 
retinal C20 methyl group. Structural refinement 
indicates that these changes represent the retinal 
having reached its twisted 13-cis configuration 
and the C20 methyl being twisted toward helix 
G. This sequence of events is entirely consistent 
with QM/MM simulations of the retinal excited 
state when the key water molecule (Wat*”), 
Lys”, and the counterion network are included 


457-646 fs 
4.00 


rotating “4 
all-trans 


disappeared 


Fig. 3. Structural intermediates at pump-probe delays (At) corresponding to times when 

key spectral intermediates of the bR photocycle occur. (A) At = 49 to 406 fs, | intermediate. 
(B) At = 457 to 646 fs, J intermediate. (C) At = 10 ps, K intermediate. (D) At = 8.33 ms, 

M intermediate. The structure of the bR dark state is shown in magenta and overlaid with 

the bR reaction intermediates in red. The difference Fourier electron density map (Fops!2" — Fons“2", 
contoured at 40; gold, negative; blue, positive) and arrows indicate the structural rearrangements 
around the isomerizing C13=C14 bond and the retinal counterion. The corresponding extrapolated 
electron density maps (2Fext — Feaic) are available as fig. S3 and Movie 1. 
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Fig. 4. Close-up view of the protonated SB and associated counterion network. (A to D) 
Progression of the difference Fourier electron density map (Fops!'2"* — Fops2"*, contoured at 40; gold, 
negative; blue, positive) shows the coherent motions of the hydrogen-bonding network connecting 
the SB with Asp®°, Asp@!?, and Tyr®” via water molecules Wat*°?, Wat*! and Wat*°°. Arrows 
indicate correlated motions in retinal and the hydrogen-bonding network. The sticks represent the 
dark-state model of bR. All temporal snapshots are available as movie S3. 


within the quantum chemical description [fig. S4 
and (9)], providing theoretical support for the 
structural evolution that we observed. Lastly, the 
difference Fourier electron density map recorded 
for At = 8.33 ms displays all major features pre- 
viously observed for At = 1.725 ms (19), including 
the planar 13-cis retinal characteristic of later re- 
action intermediates of the bR photocycle. 


Ultrafast response of the 
counterion cluster 


A key feature in the resting bR state is a hy- 
drogen bond (H-bond) interaction of the SB with 
Wat*, which participates in the H-bonding net- 
work with the counterions Asp* and Asp”” and 
stabilizes the positive charge on the SB nitrogen 
(31). Mutation of either Asp® or Asp*” to non- 
charged residues greatly reduces the rate of retinal 


Nogly et al., Science 361, eaat0094 (2018) 


13 July 2018 


isomerization (32), and both have been implicated 
in guiding stereoselectivity of the reaction (33). 
Wat*” and the SB respond to the photon ab- 
sorption of retinal even before the isomerization 
appears to begin, moving up to 0.6 A further 
away from each other (to a distance of 3.25 A) 
after light absorption (At = 49 to 406 fs). This 
separation increases to 4.05 A once the retinal 
is isomerized (At = 10 ps). This movement is visu- 
alized as very strong, paired positive and negative 
features in the difference Fourier electron den- 
sity map (from 6o to Ilo; Fig. 4) associated with 
Wat", Asp”, and Tyr*’, which together reveal 
an ultrafast collective motion of a group of polar 
residues connected through H-bond interactions 
(Movie 2). The driving force for this motion is the 
ultrafast redistribution of the positive charge 
away from the SB and toward the B-ionone ring. 


Quantum chemical calculation of the difference 
between the ground- and excited-state electron 
density (fig. S4) shows the localization of changes 
along the retinal polyene chain. The sudden po- 
larization instantaneously pushes the water mole- 
cule away from the SB (up to a distance of 3.65 A 
at At= 457 to 646 fs) and breaks the SB-Wat*©? 
H-bond (which would otherwise oppose the SB 
reorientation during isomerization). Loss of the 
H-bond lowers the energy barrier for a specific 
isomerization pathway (34), allowing the retinal 
to isomerize about the C13=C14 bond. These col- 
lective motions are later damped, with the reti- 
nal being fully isomerized at At = 10 ps, while 
Wat? loses its interaction with the SB, consistent 
with a changing H-bonding pattern in this region 
predicted by low-temperature Fourier transform 
infrared (35) and Raman spectroscopy (36). The 
rigidity of the large hydrophobic region of the 
retinal-binding pocket, which prevents unwanted 
C7=C8, C9=C10, or C11=12 cis products, is in stark 
contrast with the specific ultrafast motions as- 
sociated with Wat*” and the counterion cluster. 
These findings implicate ultrafast collective mo- 
tions as an essential ingredient for efficient isom- 
erization within the protein scaffold. 


Protein quake 


The collective motions from the counterion re- 
gion extend up to 12 A away from the SB nitrogen 
within 600 fs (fig. S5) yet become dispersed on 
longer time scales. This dampening is evidenced 
by weaker paired difference densities associated 
with Tyr’ at At = 759 to 1025 fs, as well as the 
motions of Thr*” and Asp”, which are no longer 
visible at At = 10 ps. Changes are further ap- 
parent when comparing the refined structural 
intermediates at At = 49 to 406 fs, 457 to 646 fs, 
and 10 ps (Movie 2). As such, these motions prop- 
agate away from the active site at 2 nm/ps, which 
is noticeably faster than the speed of sound in 
water (~1.5 nm/ps). These observations are con- 
sistent with the propagation of motions in the 
bacterial photoreaction center (37) and myoglobin 
(16, 38, 39) and the theory of protein quakes that 
suggests how excess energy in proteins can be dis- 
sipated in earthquake-like motions of collective 
structural deformations (40). In the case of photo- 
active proteins such as bR, these motions may 
be the mechanism through which excess energy 
from decay of the excited state is released in a 
nonradiative fashion. 

Our ultrafast TR-SFX data recorded using an 
XFEL captures a fundamental photochemical re- 
action and one of the fastest processes in biology. 
In particular, our measurements uncover the ini- 
tial structural rearrangements in the excited state 
that guide the selective isomerization process. 
Photo-excited retinal initially samples multiple 
isomerization pathways within its binding pocket, 
but the ultimate fate of the reaction is steered 
by the near-instantaneous charge redistribution 
along the retinal and the resulting collective mo- 
tions of the SB counterion complex. Breaking of 
the SB-Wat*®? H-bond enables the trans-to-cis 
isomerization about the C13=C14 double bond. 
Similar ultrafast collective motions may guide 
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the stereochemistry and enhance the quantum 
efficiency of retinal isomerization in such diverse 
processes as phototrophy in the seas, neuron stim- 
ulation in optogenetics, and the initiation of visual 
signals within the photoreceptors of our eyes. 


Materials and methods 
Purification and crystallization of bR 


Bacteriorhodopsin was purified from purple mem- 
branes of Halobacterium salinarum. The puri- 
fication and crystallization was performed as 
described previously (24, 42) with a few modifica- 
tions. The purple membranes were solubilized 
overnight in the presence of 1.7% B-octylglucoside 
(Anatrace) and 50 mM sodium phosphate buffer 
PH 6.9 (GERBU). After solubilization the pH was 
adjusted to 5.5 with 0.1 M HCl and the insoluble 
fraction was removed by 1-hour centrifugation at 
150 000 x g. The size exclusion purification step 
was omitted following (9). The protein was con- 
centrated to 40 to 80 mg ml” using Millipore 
centricolumns with 50 kDa cutoff. Lipidic Cubic 
Phase (LCP) for crystallization was obtained in a 
Hamilton syringe by mixing protein with mono- 
olein (Nu-Chek) in a 42:58 ratio. Subsequently the 
LCP was slowly injected into another Hamilton 
syringe filled with crystallization buffer consist- 
ing of 100 mM Sorensen buffer pH 5.6 (GERBU) 
and 30% polyethylene glycol 2000 (Molecular 
Dimensions). The crystallization was carried out 
at 21°C and crystals between 15 um and 50 um 
along the longest edge and with a thickness of 
1 to 5 um were obtained within 3 to 6 days (aver- 
age crystals size 35x35x3 um). All purification 
and crystallization steps were performed under 
dim red light or in the dark. 


Sample preparation for LCP-SFX 


The crystallization buffer was removed from LCP 
containing crystals via a syringe coupler by slow 
pressing on the syringe plunger. Shortly before 
the experiment monoolein was added to bring 
the sample into a homogeneous and transparent 
LCP mesophase. The sample was supplemented 
with 5% of MAG 7.9 to prevent phase transition 
during injection into vacuum and 5% of paraffin 
was added to support smooth flow of the sample. 
The crystal density within the sample was homo- 
genized using a custom made “3-way syringe 
coupler”. It consisted of three syringe inlets and 
the simultaneous passage of sample from two 
syringes into a third one over several cycles en- 
sured optimal crystal distribution throughout 
the sample volume for smooth jet operation. The 
stability of jetting was tested offline with a high- 
speed camera providing further evidence that 
the addition of paraffin improved stable jetting. 
Selected batches of the prepared crystalline sample 
were tested in advance to confirm high hit rates 
and diffraction quality using the Swiss Light Source 
set up for LCP injection (42). Before data collec- 
tion the sample was light adapted at 300 mW for 
5 min through a long-pass yellow filter (>515 nm). 
Sample was used for data collection no longer 
than 30 min after light exposure to prevent decay 
of the light adapted dark state. On average, 1 ml 
of final sample was used per 12-hour shift. 
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Data collection 

The TR-SFX experiment was performed in June 
2017 in the microfocus chamber of the CXI beam- 
line at LCLS (43) with the integrated optical 
pump-probe laser set-up (44). The 50 fs XFEL 
pulses at 9.5 keV were delivered at 120 Hz into 
a vacuum chamber. In the experiment we used 
a modified version of the LCP injector (20) with a 
larger sample reservoir of 130 ul to allow a longer 
data collection and the higher flow rates (2.5 ul 
min! through a 50 um diameter nozzle and, for 
asmall amount of data, 5 p11 min through a75 um 
diameter nozzle) necessary for time-resolved crys- 
tallography (24). The LCP stream was aligned 
with the x-ray beam and an Ti:Sapphire-pump 
Optical Parametric Amplifier producing optical 
laser pulses with a 30 Hz repetition rate, 100 fs 
pulse length, 529 nm wavelength and 17 uw 
energy in a focal spot of 95 um (1/e). The arrival 
time of the optical laser pulse relative to the XFEL 
pulse, was monitored using the timing tool at 
LCLS (25, 45) with nominal time delays At = 
-500 fs, 300 fs, 600 fs, 900 fs, 1100 fs, and 10 ps. 
As the exact timing is subject to jitter, data col- 
lected at these delays were sorted and binned 
after collection based on their timing tool signal 
(see Data sorting and binning). By combining all 
indexed patterns from the second XFEL pulse 
after the pump laser excitation (175 um from 
the interaction region) we could further resolve 
At = 8.33 ms, but with lower occupancy due to 
this time point being excited further from the 
pump laser peak intensity (fig. S1). We obtained 
a high quality dark dataset (table S1) by merging 
patterns of the fourth XFEL pulse after light 
excitation from all datasets. For clarity, the 120 Hz 
XFEL, 30 Hz optical data collection scheme is 
further illustrated in fig. S1. 

An important parameter for a successful TR- 
SFX experiment is the laser power density at 
sample position, which is critical for efficient ac- 
tivation but difficult to estimate. In our case we 
aligned the laser approximately 50 um below the 
x-ray interaction region to exclude light contam- 
ination of the following pulses. This shift reduced 
the nominal laser power density of 2.4. TW/cm? 
to 0.324 TW/cm? [factor of 0.135 (1/e~)] at the 
x-ray interaction region. This value is in a sim- 
ilar range as in previously published ultrafast 
TR-SFX studies (16, 17, 46). Using an offline setup 
with a continuous laser we established that scat- 
tering from the 50 um wide LCP jet (without 
protein or crystals) reduces the incident laser light 
by about 80%. Scattering from the LCP jet and 
the high optical density of the crystals further 
reduced the average light intensity any given bR 
molecule was exposed to. Taking the typical crys- 
tal dimensions of 35x35x3 um? and the param- 
eters above into account, we estimate 3.3 to 0.6 
absorbed photons per retinal depending on the 
orientation of the crystal with respect to the 
laser pulse. This exposure is within the range 
where multiphoton effects are negligible in a crys- 
tallographic analysis based on the small spectro- 
scopic effect under similar laser conditions (46-48). 
Moreover, our setup provided comparable re- 
sults to our previous experiment using a nano- 


second pump laser with much lower power density 
(0.06 GW cm~) at SACLA (19), with a twisted 
retinal at 10 ps and 16 ns (fig. S7). The two delays 
correspond to the lower and upper end of the 
temporal range where the K intermediate of 
bR is observed spectroscopically and the high 
similarity between the two structures indicates 
that productive photocycles can be induced using 
a femtosecond laser pump pulse. The similarity 
of structures for 10 ps/16 ns provides us with an 
internal control for the presented experiments 
in the femtosecond range. 

Our choice of laser energy was guided by two 
previous TR-SFX experiments with very similar 
femtosecond pump laser parameters but differ- 
ent laser energies (fig. S9). In December 2015, we 
used 8 uJ and 12 uJ, with the former yielding 
nearly flat Fourier difference electron density 
maps while the latter provided very weak signal 
with a few peaks at 3.5 o and the estimated acti- 
vation levels of about 5%. In July 2014, we used 
20 uJ which yielded about 13% activation levels 
(24). In the currently reported experiment we 
aimed for activation levels better than 10% and 
measured all time delays with a consistent 17 uJ in 
order to collect interpretable crystallographic data. 


Data sorting and binning 


Diffraction images were selected with Cheetah 
(49) followed by further sorting according to 
event codes corresponding to one of the four 
images collected after the optical laser pulse. 
Light data in the femtosecond range were 
sorted by pump-probe delay time using the timing 
tool available at CXI with yttrium aluminum 
garnet (YAG) target (45). The spectrally encoded 
delay was determined by the standard matched- 
filter procedure implemented in psana, the LCLS- 
provided analysis software (50). Strong etalon 
effects caused by the thick YAG target necessi- 
tated one modification to this standard analysis. 
Before edge-position determination by matched 
filtering, the spectral signal was bandstop filtered 
using a 5th-order Butterworth filter with a cut-on 
and cut-off frequency of 45 fs’ and 120 fs“! respec- 
tively, effectively removing signal oscillations due 
to etalon. The data where initially sorted into 20 bins 
with 10,000 images each (see fig. S1). For calculating 
structural snapshots, we combined 3 overlapping 
bins which lead to 18 snapshots in the femtosecond 
range, which we further complement with data 
collected at At =10 ps and 8.33 ms (see table S1). 


Data processing 


The images were indexed using mosflm and dirax 
indexing algorithms on the peaks identified by 
Cheetah. Data were integrated (the integration 
radius was 3 pixels for the peak region and 4 to 
7 pixels for the background annulus with inte- 
grating 1.2 nm beyond the conservative resolu- 
tion limit determined on a per image basis) and 
merged with CrystFEL (57) version 0.6.3. A ge- 
ometry refinement using geoptimiser was carried 
out on a subset of images that diffracted beyond 
2.0 A in CrystFEL’s conservative apparent reso- 
lution. This step improved high-resolution statis- 
tics after reindexing and reintegration of the data. 
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The indexing ambiguity in space group P6, 
was resolved using ambigator in CrystFEL (52). 
Data were scaled and merged using partialator 
in CrystFEL without the calculation of par- 
tialities (-model = unity), using one cycle of 
scaling and applying the option to extend the con- 
servative resolution limit determined on the per 
image basis (“-pushres 1.2”). A small subset of 
patterns (11,758) with HPLC back pressures 25 
bars above the average value for each data run 
was discarded in order to eliminate any source of 
potential light contamination in dark data due to 
occasional jet slowdowns owing to higher crystal 
density. 


Data analysis 


For the structure determination of the dark state, 
diffraction images from every fourth diffraction 
image after the optical laser pulse were used. 
The signal recorded by the timing tool for every 
first diffraction image after the optical laser pulse 
was used to sort the diffraction images after the 
experiment into bins with defined time ranges 
(see Data processing and binning). Each of the 
datasets (table S1) consists of three overlapping 
original bins (fig. $1). The time ranges used for 
data analysis were initially chosen to contain 
around 30,000 images (about 27,000 after index- 
ing, see table S1) to ensure good quality data for 
analysis and to be comparable to the number of 
images collected at 10 ps. The resolution cutoff 
for the light data was established so that most 
datasets have at least a CC* of 0.5 at 150 A. We 
used a dark dataset of 29,461 patterns collected 
without optical pump laser to cross-check our 
high quality dark dataset for light contamination 
and could not observe difference density peaks 
corresponding to the M-state above the noise 
level of 2.50. 


Structure determination and refinement 
of the bR dark state 


PDB entry with accession code 5J7A (24) with all 
water molecules and lipids removed was used as 
molecular replacement model in Phaser (53). 
The resting state structure was obtained with 
several cycles of building and refinement using 
COOT (54) and PHENIX (55) to a resolution of 
1.5 A (table $2). 


Calculation of difference Fourier 

density maps 

The difference Fourier density maps were cal- 
culated with phenix.fobs_minus_fobs_map (55) 
including the multiscaling feature and low reso- 
lution cutoff at 4.5 A. The electron density peak 
minima and maxima in o units (= root mean 
square electron density of the unit cell) were 
extracted in COOT (56). Table S3 lists peak values 
for regions of the density map which at any 
point in the ultrafast time range exceeded + 40. 
The values below 2.50 were set to 0 as being of 
too low significance. 


Refinement of intermediate states 


In a TR-SFX experiment, crystals subjected to op- 
tical laser light pulse contain a mixture containing 
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light activated molecules and dark state mole- 
cules. We used extrapolated structure factors Fext 
(57) to enable modeling of bacteriorhodopsin 
activated states, similar approach as used to 
model the subpicosecond structural states of 
PYP (i7). The extrapolated structure factors were 
calculated using a linear approximation (58) as 
follows: Fext = [Fopsli®"* - Fops""*) / activated 
fraction] + Fypst*"*. The 2Fext - Feale Maps cal- 
culated with phases of the dark state model 
showed distinct features in agreement with the 
Fops23* - Foy 54"* Fourier difference maps. Using 
this comparison, the light activated fraction of 
molecules in the data time ranges 49 to 406 fs, 
457 to 646 fs, 10 ps, and 8.3 ms was estimated 
to be 25%, 18%, 16%, and 10% corresponding to 
fractions where the features associated exclusively 
with the dark state disappear from extrapolated 
maps. Reciprocal space refinement against the 
F..¢ to 1.9 A resolution in PHENIX (55) in com- 
bination with real space fitting in COOT (56) 
provided models (table S2) for the respective 
time points with all of them showing about 
98% of residues in the Ramachandran favored 
region. A comparison of calculated (Fea "4 - 
Feaic'"*) and experimental (F,,/2"" - F, 
difference densities (fig. S8) was used as addi- 
tional control to validate the refined structures. 
Comparison of the bR 8.33 ms structure obtained 
from the refinement against the extrapolated 
structure factors with the 1.725 ms structure (19) 
obtained by an alternative conformation refine- 
ment approach shows a high level of similarity 
between the structures derived using the two dif- 
ferent approaches (fig. S6). 


dark 
bee) 


QM/MM calculations 


We used a hybrid quantum mechanics/molecular 
mechanics (QM/MM) method to study the photo- 
chemical reaction of bR. The key step in setting up 
such a QM/MM calculation is the partitioning of 
the total system in QM and MM subsystems. The 
QM subsystem is described using a quantum 
chemical method, while the MM subsystem is 
computed by a classical force field. Following the 
crystallographic results, we included the retinal 
chromophore and the Lys”"* sidechain, as well as 
the counter-ion sidechains Arg*’, Asp*’, Asp?” 
and the water molecules Wat*°”, Wat**, Wat*° 
to the QM part, while the rest of the protein was 
treated at the MM level. The MM subsystem is 
treated using an AMBER force field and interacts 
with the QM subsection through electrostatic 
embedding. In all the simulations we have per- 
formed, the protein backbone was kept fixed in 
the crystallographic position and only the QM 
section and all sidechains, which have at least 
one atom within 5 A from the retinal Schiff base 
and Lys”"° moiety was allowed to be moved freely 
in the geometry optimization. 

To optimize the geometry in the ground state 
the QM subsystem was described using the pure- 
GGA BP8é6 functional with Grimme’s corrections 
for dispersion interactions and the cc-pVDZ basis 
set. The difference between the ground and excited 
state density was computed using CAM-B3LYP 
and the cc-pVTZ basis set. 
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Multimessenger observations of a 
flaring blazar coincident with 
high-energy neutrino IceCube-170922A 
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INTEGRAL, Kanata, Kiso, Kapteyn, Liverpool Telescope, Subaru, Swift/NuSTAR, 


VERITAS, and VLA/17B-403 teams* +} 


INTRODUCTION: Neutrinos are tracers of 
cosmic-ray acceleration: electrically neutral 
and traveling at nearly the speed of light, they 
can escape the densest environments and may 
be traced back to their source of origin. High- 
energy neutrinos are expected to be produced 
in blazars: intense extragalactic radio, optical, 
x-ray, and, in some cases, y-ray sources 
characterized by relativistic jets of 
plasma pointing close to our line of 
sight. Blazars are among the most 
powerful objects in the Universe and 
are widely speculated to be sources 
of high-energy cosmic rays. These cos- 
mic rays generate high-energy neutri- 
nos and y-rays, which are produced 
when the cosmic rays accelerated in 
the jet interact with nearby gas or 
photons. On 22 September 2017, the 
cubic-kilometer IceCube Neutrino 
Observatory detected a ~290-TeV 
neutrino from a direction consistent 
with the flaring y-ray blazar TXS 
0506+056. We report the details of 
this observation and the results of a 
multiwavelength follow-up campaign. 


Declination [°] 


RATIONALE: Multimessenger astron- 
omy aims for globally coordinated 
observations of cosmic rays, neutri- 
nos, gravitational waves, and electro- 
magnetic radiation across a broad 
range of wavelengths. The combi- 
nation is expected to yield crucial 


mic rays. The discovery of an extraterrestrial 
diffuse flux of high-energy neutrinos, announced 
by IceCube in 2013, has characteristic prop- 
erties that hint at contributions from extra- 
galactic sources, although the individual sources 
remain as yet unidentified. Continuously mon- 
itoring the entire sky for astrophysical neu- 
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trinos, IceCube provides real-time triggers for 
observatories around the world measuring 
y-rays, x-rays, optical, radio, and gravitational 
waves, allowing for the potential identification 
of even rapidly fading sources. 


RESULTS: A high-energy neutrino-induced 
muon track was detected on 22 September 2017, 
automatically generating an alert that was 
distributed worldwide 
within 1 min of detection 
Read the full article 28d Prompted follow-up 
at http://dx.doi. searches by telescopes over 
org/10.1126/ a broad range of wave- 
science.aatl378 lengths. On 28 September 
coitf ap rie anueecccaatame 3017, the Fermi Lane Area 
Telescope Collaboration reported that the di- 
rection of the neutrino was coincident with a 
cataloged y-ray source, 0.1° from the neutrino 
direction. The source, a blazar known as TXS 
0506+056 at a measured redshift of 0.34, was 
in a flaring state at the time with enhanced 
y-ray activity in the GeV range. Follow-up ob- 
servations by imaging atmospheric Cherenkov 
telescopes, notably the Major Atmospheric 
Gamma Imaging Cherenkov (MAGIC) 
telescopes, revealed periods where 
the detected y-ray flux from the blazar 
reached energies up to 400 GeV. Mea- 
surements of the source have also 
been completed at x-ray, optical, and 
radio wavelengths. We have inves- 
tigated models associating neutrino 
and y-ray production and find that 
correlation of the neutrino with the 
flare of TXS 0506+056 is statistically 
significant at the level of 3 standard 
deviations (sigma). On the basis of the 
redshift of TXS 0506+056, we derive 
constraints for the muon-neutrino 
luminosity for this source and find 
Vibes them to be similar to the luminosity 
“: +1 observed in y-rays. 
© 
. CONCLUSION: The energies of the 
; y-rays and the neutrino indicate that 
blazar jets may accelerate cosmic rays 
to at least several PeV. The observed 
association of a high-energy neutrino 
with a blazar during a period of en- 
hanced y-ray emission suggests that 


information on the mechanisms 
energizing the most powerful astro- 
physical sources. That the produc- 
tion of neutrinos is accompanied by 
electromagnetic radiation from the 
source favors the chances of a multi- 
wavelength identification. In par- 
ticular, a measured association of 
high-energy neutrinos with a flaring 
source of y-rays would elucidate the 
mechanisms and conditions for ac- 
celeration of the highest-energy cos- 
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Multimessenger observations of blazar TXS 0506+056. The 
50% and 90% containment regions for the neutrino IceCube- 
170922A (dashed red and solid gray contours, respectively), 
overlain on a V-band optical image of the sky. Gamma-ray sources 
in this region previously detected with the Fermi spacecraft are 
shown as blue circles, with sizes representing their 95% positional 
uncertainty and labeled with the source names. The IceCube 
neutrino is coincident with the blazar TXS 0506+056, whose 
optical position is shown by the pink square. The yellow circle 
shows the 95% positional uncertainty of very-high-energy y-rays 
detected by the MAGIC telescopes during the follow-up campaign. 
The inset shows a magnified view of the region around TXS 0506+056 
on an R-band optical image of the sky. 
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blazars may indeed be one of the long- 
sought sources of very-high-energy 
cosmic rays, and hence responsible for 
a sizable fraction of the cosmic neu- 
trino flux observed by IceCube. 


The list of author affiliations is available in the full 
article online. 

*The full lists of participating members for each 
team and their affiliations are provided in the 
supplementary materials. 

tEmail: analysis@icecube.wisc.edu 

Cite this article as IceCube Collaboration et al., 
Science 361, eaat1378 (2018). DOI: 10.1126/ 
science.aat1378 
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Multimessenger observations of a 
flaring blazar coincident with 
high-energy neutrino IceCube-170922A 


The IceCube Collaboration, Fermi-LAT, MAGIC, AGILE, ASAS-SN, HAWC, H.E.S.S., 
INTEGRAL, Kanata, Kiso, Kapteyn, Liverpool Telescope, Subaru, Swift/NuSTAR, 


VERITAS, and VLA/17B-403 teams* 


Previous detections of individual astrophysical sources of neutrinos are limited to the 
Sun and the supernova 1987A, whereas the origins of the diffuse flux of high-energy 
cosmic neutrinos remain unidentified. On 22 September 2017, we detected a high-energy 
neutrino, lceCube-170922A, with an energy of ~290 tera-electron volts. Its arrival 
direction was consistent with the location of a known y-ray blazar, TXS 0506+056, 
observed to be in a flaring state. An extensive multiwavelength campaign followed, 
ranging from radio frequencies to y-rays. These observations characterize the 

variability and energetics of the blazar and include the detection of TXS 0506+056 

in very-high-energy y-rays. This observation of a neutrino in spatial coincidence with 

a y-ray—-emitting blazar during an active phase suggests that blazars may be a source 


of high-energy neutrinos. 


ince the discovery of a diffuse flux of high- 
energy astrophysical neutrinos (J, 2), 
IceCube has searched for its sources. The 
only nonterrestrial neutrino sources iden- 
tified previously are the Sun and the super- 
nova 1987A, producing neutrinos with energies 
millions of times lower than the high-energy dif- 
fuse flux, such that the mechanisms and the envi- 
ronments responsible for the high-energy cosmic 
neutrinos are still to be ascertained (3, 4). Many 
candidate source types exist, with active galactic 
nuclei (AGN) among the most prominent (5), in 
particular the small fraction of them designated 
as radio-loud (6). In these AGNs, the central su- 
permassive black hole converts gravitational energy 
of accreting matter and/or the rotational energy 
of the black hole into powerful relativistic jets, 
within which particles can be accelerated to high 
energies. If a number of these particles are pro- 
tons or nuclei, their interactions with the radia- 
tion fields and matter close to the source would 
give rise to a flux of high-energy pions that even- 
tually decay into photons and neutrinos (7). In 
blazars (8)—AGNs that have one of the jets point- 
ing close to our line of sight—the observable flux 
of neutrinos and radiation is expected to be greatly 
enhanced owing to relativistic Doppler boosting. 
Blazar electromagnetic (EM) emission is known 
to be highly variable on time scales from minutes 
to years (9). 
Neutrinos travel largely unhindered by matter 
and radiation. Even if high-energy photons (TeV 


*The full lists of participating members for each team and their 
affiliations are provided in the supplementary materials. 
tEmail: analysis@icecube.wisc.edu 
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and above) are unable to escape the source owing 
to intrinsic absorption, or are absorbed by inter- 
actions with the extragalactic background light 
(EBL) (10, 11), high-energy neutrinos may escape 
and travel unimpeded to Earth. An association 
of observed astrophysical neutrinos with blazars 
would therefore imply that high-energy protons 
or nuclei up to energies of at least tens of PeV are 
produced in blazar jets, suggesting that they may 
be the birthplaces of the most energetic particles 
observed in the Universe, the ultrahigh-energy 
cosmic rays (72). If neutrinos are produced in 
correlation with photons, the coincident obser- 
vation of neutrinos with electromagnetic flares 
would greatly increase the chances of identifying 
the source(s). Neutrino detections must therefore 
be combined with the information from broad- 
band observations across the electromagnetic 
spectrum (multimessenger observations). 

To take advantage of multimessenger oppor- 
tunities, the IceCube neutrino observatory (13) 
has established a system of real-time alerts that 
rapidly notify the astronomical community of the 
direction of astrophysical neutrino candidates 
(14). From the start of the program in April 2016 
through October 2017, 10 public alerts have been 
issued for high-energy neutrino candidate events 
with well-reconstructed directions (15). 

We report the detection of a high-energy neu- 
trino by IceCube and the multiwavelength/multi- 
instrument observations of a flaring y-ray blazar, 
TXS 0506+056, which was found to be position- 
ally coincident with the neutrino direction (6). 
Chance coincidence of the IceCube-170922A 
event with the flare of TXS 0506+056 is statis- 
tically disfavored at the level of 30 in models 
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evaluated below, associating neutrino and y-ray 
production. 


The neutrino alert 


IceCube is a neutrino observatory with more 
than 5000 optical sensors embedded in 1 km? of 
the Antarctic ice-sheet close to the Amundsen- 
Scott South Pole Station. The detector consists of 
86 vertical strings frozen into the ice 125 m apart, 
each equipped with 60 digital optical modules 
(DOMs) at depths between 1450 and 2450 m. 
When a high-energy muon-neutrino interacts 
with an atomic nucleus in or close to the detec- 
tor array, a muon is produced moving through 
the ice at superluminal speed and creating 
Cherenkov radiation detected by the DOMs. On 
22 September 2017 at 20:54:30.43 Coordinated 
Universal Time (UTC), a high-energy neutrino- 
induced muon track event was detected in an 
automated analysis that is part of IceCube’s real- 
time alert system. An automated alert was dis- 
tributed (77) to observers 43 s later, providing an 
initial estimate of the direction and energy of the 
event. A sequence of refined reconstruction algo- 
rithms was automatically started at the same 
time, using the full event information. A repre- 
sentation of this neutrino event with the best- 
fitting reconstructed direction is shown in Fig. 1. 
Monitoring data from IceCube indicate that the 
observatory was functioning normally at the time 
of the event. 

A Gamma-ray Coordinates Network (GCN) 
Circular (78) was issued ~4 hours after the initial 
notice, including the refined directional informa- 
tion (offset 0.14° from the initial direction; see 
Fig. 2). Subsequently, further studies were per- 
formed to determine the uncertainty of the direc- 
tional reconstruction arising from statistical and 
systematic effects, leading to a best-fitting right 
ascension (RA) 77.43*-8? and declination (Dec) 
+5.7279 20 (degrees, J2000 equinox, 90% con- 
tainment region). The alert was later reported 
to be in positional coincidence with the known 
y-ray blazar TXS 0506+056 (16), which is lo- 
cated at RA 77.36° and Dec +5.69° (J2000) (19), 
0.1° from the arrival direction of the high-energy 
neutrino. 

The IceCube alert prompted a follow-up search 
by the Mediterranean neutrino telescope ANTARES 
(Astronomy with a Neutrino Telescope and Abyss 
environmental RESearch) (20). The sensitivity of 
ANTARES at the declination of IceCube-170922A 
is about one-tenth that of IceCube’s (27), and no 
neutrino candidates were found in a +1 day period 
around the event time (22). 

An energy of 23.7 + 2.8 TeV was deposited in 
IceCube by the traversing muon. To estimate the 
parent neutrino energy, we performed simulations 
of the response of the detector array, considering 
that the muon-neutrino might have interacted 
outside the detector at an unknown distance. We 
assumed the best-fitting power-law energy spec- 
trum for astrophysical high-energy muon neutri- 
nos, dN /dE«E~?-8 (2), where N is the number 
of neutrinos as a function of energy E. The sim- 
ulations yielded a most probable neutrino energy 
of 290 TeV, with a 90% confidence level (CL) 
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lower limit of 183 TeV, depending only weakly on 
the assumed astrophysical energy spectrum (25). 

The vast majority of neutrinos detected by 
IceCube arise from cosmic-ray interactions within 
Earth’s atmosphere. Although atmospheric neu- 
trinos are dominant at energies below 100 TeV, 
their spectrum falls steeply with energy, allowing 
astrophysical neutrinos to be more easily identi- 
fied at higher energies. The muon-neutrino as- 


Fig. 1. Event display for 
neutrino event IceCube- 
170922A. The time at which a 
DOM observed a signal is 
reflected in the color of the hit, 
with dark blues for earliest hits 
and yellow for latest. Times 
shown are relative to the first 
DOM hit according to the track 
reconstruction, and earlier and 
later times are shown with the 
same colors as the first and 
last times, respectively. The 
total time the event took to 
cross the detector is ~3000 ns. 
The size of a colored sphere is 
proportional to the logarithm 
of the amount of light 
observed at the DOM, with 
larger spheres corresponding 
to larger signals. The total 


side view 


trophysical spectrum, together with simulated 
data, was used to calculate the probability that a 
neutrino at the observed track energy and zenith 
angle in IceCube is of astrophysical origin. This 
probability, the so-called signalness of the event 
(14), was reported to be 56.5% (17). Although 
IceCube can robustly identify astrophysical neu- 
trinos at PeV energies, for individual neutrinos 


cannot be excluded. Electromagnetic observations 
are valuable to assess the possible association of 
a single neutrino to an astrophysical source. 
Following the alert, IceCube performed a 
complete analysis of relevant data prior to 
31 October 2017. Although no additional excess 
of neutrinos was found from the direction of TXS 
0506+056 near the time of the alert, there are 


at several hundred TeV, an atmospheric origin 
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Fig. 2. Fermi-LAT and MAGIC observations of IceCube-170922A's 
location. Sky position of lceCube-170922A in J2000 equatorial coordinates 
overlaying the y-ray counts from Fermi-LAT above 1 GeV (A) and the signal 
significance as observed by MAGIC (B) in this region. The tan square 
indicates the position reported in the initial alert, and the green square 
indicates the final best-fitting position from follow-up reconstructions (18). 
Gray and red curves show the 50% and 90% neutrino containment regions, 
respectively, including statistical and systematic errors. Fermi-LAT data are 
shown as a photon counts map in 9.5 years of data in units of counts per 
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a y-ray source observed 


13 July 2018 


TXS 0506+086am 


MAGIC significance [o] 


PKS 0502+049° 


77.6° 12 
Right Ascension 


76.8° 76.4° 


pixel, using detected photons with energy of 1 to 300 GeV in a 2° by 2° 
region around TXSO506+056. The map has a pixel size of 0.02° and was 
smoothed with a 0.02°-wide Gaussian kernel. MAGIC data are shown as 
signal significance for y-rays above 90 GeV. Also shown are the locations of 


by Fermi-LAT as given in the Fermi-LAT Third 


Source Catalog (3FGL) (23) and the Third Catalog of Hard Fermi-LAT 
Sources (3FHL) (24) source catalogs, including the identified positionally 
coincident 3FGL object TXS 0506+056. For Fermi-LAT catalog objects, 
marker sizes indicate the 95% CL positional uncertainty of the source. 
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emission from that direction in data prior to 2017, 
as discussed in a companion paper (26). 


High-energy y-ray observations of 
TXS 0506+056 


On 28 September 2017, the Fermi Large Area 
Telescope (LAT) Collaboration reported that the 
direction of origin of IceCube-170922A was con- 
sistent with a known y-ray source in a state of 
enhanced emission (16). Fermi-LAT is a pair- 
conversion telescope aboard the Fermi Gamma- 
ray Space Telescope sensitive to y-rays with energies 
from 20 MeV to greater than 300 GeV (27). Since 
August 2008, it has operated continuously, pri- 
marily in an all-sky survey mode. Its wide field 
of view of ~2.4 steradian provides coverage of the 
entire y-ray sky every 3 hours. The search for pos- 
sible counterparts to IceCube-170922A was part of 
the Fermi-LAT collaboration’s routine multiwave- 
length, multimessenger program. 

Inside the error region of the neutrino event, 
a positional coincidence was found with a pre- 
viously cataloged y-ray source, 0.1° from the best- 
fitting neutrino direction. TXS 0506+056 is a 
blazar of BL Lacertae (BL Lac) type. Its redshift of 
% = 0.3365+0.0010 was measured only recently 
based on the optical emission spectrum in a 
study triggered by the observation of IceCube- 
170922A (28). 


TXS 0506+056 is a known Fermi-LAT y-ray 
source, appearing in three catalogs of Fermi 
sources (23, 24, 29) at energies above 0.1, 50, and 
10 GeV, respectively. An examination of the 
Fermi All-Sky Variability Analysis (FAVA) (30) 
photometric light curve for this object showed 
that TXS 0506+056 had brightened consider- 
ably in the GeV band starting in April 2017 (16). 
Independently, a y-ray flare was also found by 
Fermi’s Automated Science Processing [ASP (25)]. 
Such flaring is not unusual for a BL Lac object and 
would not have been followed up as extensively if 
the neutrino were not detected. 

Figure 3 shows the Fermi-LAT light curve and 
the detection time of the neutrino alert. The light 
curve of TXS 0506+056 from August 2008 to 
October 2017 was calculated in bins of 28 days for 
the energy range above 0.1 GeV. An additional 
light curve with 7-day bins was calculated for the 
period around the time of the neutrino alert. The 
y-ray flux of TXS 0506+056 in each time bin was 
determined through a simultaneous fit of this 
source and the other Fermi-LAT sources in a 
10° by 10° region of interest along with the 
Galactic and isotropic diffuse backgrounds, using 
a maximum-likelihood technique (25). The inte- 
grated y-ray flux of TXS 0506+056 for E > 0.1 GeV, 
averaged over all Fermi-LAT observations span- 
ning 9.5 years, is (7.6 + 0.2) x 10-$§ cm? s"1. The 


highest flux observed in a single 7-day light curve 
bin was (5.3 + 0.6) x 10-7 cm”? s“!, measured in 
the week 4 to 11 July 2017. Strong flux variations 
were observed during the y-ray flare, the most prom- 
inent being a flux increase from (7.9 + 2.9) x 
10-8 cm~* s-! in the week 8 to 15 August 2017 
to (4.0+0.5) x 10-7 cm~? s"! in the week 15 to 
22 August 2017. 

The Astro-Rivelatore Gamma a Immagini Leg- 
gero (AGILE) y-ray telescope (31) confirmed the 
elevated level of y-ray emission at energies above 
0.1 GeV from TXS 0506+056 in a 13-day window 
(10 to 23 September 2017). The AGILE measured 
flux of (5.3 + 2.1) x 10-7 em’ Tis consistent with 
the Fermi-LAT observations in this time period. 

High-energy y-ray observations are shown in 
Figs. 3 and 4. Details on the Fermi-LAT and AGILE 
analyses can be found in (25). 


Very-high-energy y-ray observations of 
TXS 0506+056 


Following the announcement of IceCube-170922A, 
TXS 0506+056 was observed by several ground- 
based Imaging Atmospheric Cherenkov Tele- 
scopes (IACTs). A total of 1.3 hours of observations 
in the direction of the blazar TXS 0506+056 
were taken using the High-Energy Stereoscopic 
System (H.E.S.S.) (32), located in Namibia, on 
23 September 2017 [Modified Julian Date (MJD) 
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Fig. 3. Time-dependent multiwavelength observations of TXS 
0506+056 before and after IceCube-170922A. Significant variability of 
the electromagnetic emission can be observed in all displayed energy 
bands, with the source being in a high-emission state around the 

time of the neutrino alert. From top to bottom: (A) VHE y-ray 
observations by MAGIC, H.E.S.S., and VERITAS; (B) high-energy y-ray 
observations by Fermi-LAT and AGILE; (C and D) x-ray observations by 
Swift XRT; (E) optical light curves from ASAS-SN, Kiso/KWFC, and 


Kanata/HONIR; and (F) radio observations by OVRO and VLA. The red and not shown. 
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Modified Julian Date 


dashed line marks the detection time of the neutrino IceCube-170922A. 
The left set of panels shows measurements between MJD 54700 

(22 August 2008) and MJD 58002 (6 September 2017). The set of 
panels on the right shows an expanded scale for time range 

MJD 58002 to MJD 58050 (24 October 2017). The Fermi-LAT light 
curve is binned in 28-day bins on the left panel, while finer 7-day bins 
are used on the expanded panel. A VERITAS limit from MJD 58019.40 
(23 September 2017) of 2.1 x 10-!° cm-2 s~1 is off the scale of the plot 
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58019], ~4 hours after the circulation of the neu- 
trino alert. A 1-hour follow-up observation of the 
neutrino alert under partial cloud coverage was 
performed using the Very Energetic Radiation 
Imaging Telescope Array System (VERITAS) y-ray 
telescope array (33), located in Arizona, USA, later 
on the same day, ~12 hours after the IceCube 
detection. Both telescopes made additional obser- 
vations on subsequent nights, but neither detected 
y-ray emission from the source [see Fig. 3 and 
(25)]. Upper limits at 95% CL on the y-ray flux 
were derived accordingly (assuming the mea- 
sured spectrum, see below): 7.5 x 10°? em™? st 
during the H.E.S.S. observation period and 1.2 
10-" cm”? s"! during the VERITAS observations, 
both for energies E >175 GeV. 

The Major Atmospheric Gamma Imaging 
Cherenkov (MAGIC) Telescopes (34) observed 
TXS 0506+056 for 2 hours on 24 September 2017 
(MJD 58020) under nonoptimal weather con- 
ditions and then for a period of 13 hours from 
28 September to 4 October 2017 (MJD 58024- 
58030) under good conditions. MAGIC consists 
of two 17-m telescopes, located at the Roque de 
los Muchachos Observatory on the Canary 
Island of La Palma (Spain). 

No y-ray emission from TXS 0506+056 was 
detected in the initial MAGIC observations on 
24: September 2017, and an upper limit was derived 
on the flux above 90 GeV of 3.6 x 10°" cm? s*! 


Fig. 4. Broadband spectral 
energy distribution for the blazar 


at 95% CL (assuming a spectrum dN /dE <E~?°). 
However, prompted by the Fermi-LAT detection 
of enhanced y-ray emission, MAGIC performed 
another 13 hours of observations of the region 
starting 28 September 2017. Integrating the data, 
MAGIC detected a significant very-high-energy 
(VHE) y-ray signal (35) corresponding to 374 + 
62 excess photons, with observed energies up to 
about 400 GeV. This represents a 6.20 excess Over 
expected background levels (25). The day-by-day 
light curve of TXS 0506+056 for energies above 
90 GeV is shown in Fig. 3. The probability that a 
constant flux is consistent with the data is less 
than 1.35%. The measured differential photon 
spectrum (Fig. 4) can be described over the energy 
range of 80 to 400 GeV by a simple power law, 
dN /dE~E’, with a spectral index y = —3.9 + 0.4 
and a flux normalization of (2.0 + 0.4) x 107° 
TeV t cm? s-! at E = 130 GeV. Uncertainties are 
statistical only. The estimated systematic uncer- 
tainties are <15% in the energy scale, 11 to 18% in 
the flux normalization, and +0.15 for the power- 
law slope of the energy spectrum (34). Further 
observations after 4 October 2017 were prevented 
by the full Moon. 

An upper limit to the redshift of TXS 0506+056 
can be inferred from VHE y-ray observations 
using limits on the attenuation of the VHE flux 
due to interaction with the EBL. Details on the 
method are available in (25). The obtained upper 


limit ranges from 0.61 to 0.98 at a 95% CL, de- 
pending on the EBL model used. These upper 
limits are consistent with the measured redshift 
of g = 0.3365 (28). 

No y-ray source above 1 TeV at the location of 
TXS 0506+056 was found in survey data of the 
High Altitude Water Cherenkov (HAWC) y-ray 
observatory (36), either close to the time of the 
neutrino alert or in archival data taken since 
November 2014 (25). 

VHE y-ray observations are shown in Figs. 3 
and 4. All measurements are consistent with the 
observed flux from MAGIC, considering the dif- 
ferences in exposure, energy range, and obser- 
vation periods. 


Radio, optical, and x-ray observations 


The Karl G. Jansky Very Large Array (VLA) (37) 
observed TXS 0506+056 starting 2 weeks after 
the alert in several radio bands from 2 to 12 GHz 
(38), detecting significant radio flux variability 
and some spectral variability of this source. The 
source is also in the long-term blazar monitoring 
program of the Owens Valley Radio Observatory 
(OVRO) 40-m telescope at 15 GHz (39). The light 
curve shows a gradual increase in radio emission 
during the 18 months preceding the neutrino alert. 

Optical observations were performed by 
the All-Sky Automated Survey for Supernovae 
(ASAS-SN) (40), the Liverpool Telescope (4), the 


log(Frequency [Hz]) 
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TXS 0506+056. The SED is 
based on observations obtained 
within 14 days of the detection of 
the IceCube-170922A event. The 
E*dN/dE vertical axis is equivalent 
to a vF, scale. Contributions are 
provided by the following 
instruments: VLA (38), OVRO 
(39), Kanata Hiroshima Optical 
and Near-InfraRed camera 
(HONIR) (52), Kiso, and the Kiso 
Wide Field Camera (KWFC) (43), 
Southeastern Association for 
Research in Astronomy Observa- 
tory (SARA/UA) (53), ASAS-SN 
(54), Swift Ultraviolet and Optical 
Telescope (UVOT) and XRT (55), 
NuSTAR (56), INTEGRAL (57), 
AGILE (58), Fermi-LAT (16), 
MAGIC (35), VERITAS (59), H.E.S.S. 
(60), and HAWC (61). Specific 
observation dates and times are 
provided in (25). Differential flux 
upper limits (Shown as colored 
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bands and indicated as “UL” in the legend) are quoted at the 95% CL, 
while markers indicate significant detections. Archival observations are 
shown in gray to illustrate the historical flux level of the blazar in the 
radio-to-keV range as retrieved from the ASDC SED Builder (62), and in the 
y-ray band as listed in the Fermi-LAT 3FGL catalog (23) and from an 
analysis of 2.5 years of HAWC data. The y-ray observations have not been 
corrected for absorption owing to the EBL. SARA/UA, ASAS-SN, and 
Kiso/KWFC observations have not been corrected for Galactic attenua- 
tion. The electromagnetic SED displays a double-bump structure, one 
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peaking in the optical-ultraviolet range and the second one in the GeV 
range, which is characteristic of the nonthermal emission from blazars. 
Even within this 14-day period, there is variability observed in several of the 
energy bands shown (see Fig. 3), and the data are not all obtained 
simultaneously. Representative v, + ¥, neutrino flux upper limits that 
produce on average one detection like IceCube-170922A over a period 

of 0.5 (solid black line) and 7.5 years (dashed black line) are shown, 
assuming a spectrum of dN/dE << E-* at the most probable neutrino 
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Kanata Telescope (42), the Kiso Schmidt Tele- 
scope (43), the high-resolution spectrograph (HRS) 
on the Southern African Large Telescope (SALT) 
(44), the Subaru telescope Faint Object Camera 
and Spectrograph (FOCAS) (45), and the X- 
SHOOTER instrument on the Very Large Tele- 
scope (VLT) (46). The V band flux of the source is 
the highest observed in recent years, and the 
spectral energy distribution has shifted toward 
blue wavelengths. Polarization was detected by 
Kanata in the R band at the level of 7%. Redshift 
determination for BL Lac objects is difficult owing 
to the nonthermal continuum from the nucleus 
outshining the spectral lines from the host galaxies. 
Attempts were made using optical spectra from 
the Liverpool, Subaru, and VLT telescopes to 
measure the redshift of TXS 0506+056, but only 
limits could be derived [see, e.g., (47)]. The redshift 
of TXS 0506+056 was later determined to be 
& = 0.3365 + 0.0010 using the Gran Telescopio 
Canarias (28). 

X-ray observations were made by the X-Ray 
Telescope (XRT) on the Neil Gehrels Swift 
Observatory (0.3 to 10 keV) (48), MAXT Gas Slit 
Camera (GSC) (2 to 10 keV) (49), Nuclear Spectro- 
scopic Telescope Array (NuSTAR) (3 to 79 keV) 
(50), and the INTernational Gamma-Ray Astro- 
physics Laboratory INTEGRAL) (20 to 250 keV) 
(51), with detections by Swift and NuSTAR .Ina 
2.1 square degree region around the neutrino 
alert, Swift identified nine x-ray sources, includ- 
ing TXS 0506+056. 

Swift monitored the x-ray flux from TXS 
0506+056 for 4 weeks after the alert, starting 
23 September 2017 00:09:16 UT, finding clear 
evidence for spectral variability (see Fig. 3D). The 
strong increase in flux observed at VHE energies 
over several days up until MJD 58030 (4: October 
2017) correlates well with an increase in the x-ray 
emission during this period of time. The spec- 
trum of TXS 0506+056 observed in the week 
after the flare is compatible with the sum of two 
power-law spectra, a soft spectrum with index 
—2.8 + 0.3in the soft x-ray band covered by Swift 
XRT, and a hard spectrum with index —1.4 + 0.3 
in the hard x-ray band covered by NuSTAR (25). 
Extrapolated to 20 MeV, the NuSTAR hard- 
spectrum component connects smoothly to the 
plateau (index —2) component observed by the 
Fermi-LAT between 0.1 and 100 GeV and the soft 
VHE y-ray component observed by MAGIC (com- 
pare Fig. 4). Taken together, these observations 
provide a mostly complete, contemporaneous 
picture of the source emissions from 0.3 keV to 
400 GeV, more than nine orders of magnitude in 
photon energy. 

Figures 3 and 4 summarize the multiwave- 
length light curves and the changes in the broad- 
band spectral energy distribution (SED), compared 
to archival observations. Additional details about 
the radio, optical, and x-ray observations can be 
found in (25). 


Chance coincidence probability 


Data obtained from multiwavelength observa- 
tions of TXS 0506+056 can be used to constrain 
the blazar-neutrino chance coincidence probabil- 


The IceCube Collaboration et al., Science 361, eaat1378 (2018) 


ity. This coincidence probability is a measure of 
the likelihood that a neutrino alert like IceCube- 
170922A is correlated by chance with a flaring 
blazar, considering the large number of known 
y-ray sources and the modest number of neu- 
trino alerts. 

Given the large number of potential neutrino 
source classes available, no a priori blazar-neutrino 
coincidence model was selected ahead of the 
alert. After the observation, however, several cor- 
relation scenarios were considered and tested 
to quantify the a posteriori significance of the 
observed coincidence. Testing multiple models 
is important as the specific assumptions about 
the correlation between neutrinos and y-rays have 
an impact on the chance coincidence probability. 
In each case, the probability to obtain, by chance, 
a degree of correlation at least as high as that ob- 
served for IceCube-170922A was calculated using 
simulated neutrino alerts and the light curves of 
Fermi-LAT y-ray sources. Given the continuous 
all-sky monitoring of the Fermi-LAT since 2008, 
all tests utilized 28-day binned y-ray light curves 
above 1 GeV from 2257 extragalactic Fermi-LAT 
sources, derived in the same manner as used for 
the analysis of TXS 0506+056 y-ray data. 

To calculate the chance probabilities, a like- 
lihood ratio test is used that allows different 
models of blazar-neutrino flux correlation to be 
evaluated in a consistent manner. All models as- 
sume that at least some of the observed y-ray flux 
is produced in the same hadronic interactions 
that would produce high-energy neutrinos within 
the source. Our first model assumes that the 
neutrino flux is linearly correlated with the high- 
energy y-ray energy flux (4). In this scenario, 
neutrinos are more likely to be produced during 
periods of bright, hard y-ray emission. In the 
second model, the neutrino flux is modeled as 
strongly tied to variations in the observed y-ray 
flux, regardless of the average flux of y-rays. Here, 
a weak or a strong y-ray source is equally likely to 
be a neutrino source if the neutrino is temporally 
correlated with variability in the y-ray light curve. 
Third, we consider a correlation of the neutrino 
flux with the VHE j-ray flux. Because hadronic 
acceleration up to a few PeV is required to explain 
the detected neutrino energy, VHE y-ray sources 
are potential progenitors. Full details and results 
from these analyses are presented in (25). 

The neutrino IceCube-170922A was found to 
arrive in a period of flaring activity in high-energy 
y-rays. Prior to IceCube-170922A, nine public 
alerts had been issued by the IceCube real-time 
system. Additionally, 41 archival events have been 
identified among the IceCube data recorded since 
2010, before the start of the real-time program 
in April 2016, which would have caused alerts 
if the real-time alert system had been in place. 
These events were also tested for coincidence 
with the y-ray data. 

Chance coincidence of the neutrino with the 
flare of TXS 0506+056 is disfavored at the 30 
level in any scenario where neutrino production 
is linearly correlated with y-ray production or 
with y-ray flux variations. This includes look- 
elsewhere corrections for all 10 alerts issued 
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previously by IceCube and the 41 archival events. 
One of the neutrino events that would have been 
sent as an alert and had a good angular reso- 
lution (<5°) is in a spatial correlation with the 
y-ray blazar 3FGL J1040.4+0615. However, this 
source was not in a particularly bright emission 
state at the detection time of the corresponding 
neutrino. Therefore, a substantially lower test 
statistic would be obtained in the chance cor- 
relation tests defined in this paper (25). 

We have investigated how typical the blazar 
TXS 0506+056 is among those blazars that might 
have given rise to a coincident observation sim- 
ilar to the one reported here. A simulation that 
assumes that the neutrino flux is linearly cor- 
related with the blazar y-ray energy flux shows 
that in 14% of the signal realizations, we would 
find a neutrino coincident with a similarly bright 
y-ray state as that observed for TXS 0506+056 
(25). The detection of a single neutrino does not 
allow us to probe the details of neutrino produc- 
tion models or measure the neutrino-to-y-ray 
production ratio. Further observations will be 
needed to unambiguously establish a correlation 
between high-energy neutrinos and blazars, as 
well as to understand the emission and acceler- 
ation mechanism in the event of a correlation. 


Discussion 


Blazars have often been suggested as potential 
sources of high-energy neutrinos. The calorimetric 
high-energy output of certain candidate blazars is 
high enough to explain individual observed IceCube 
events at 100-TeV to 1-PeV energies (63). Spatial 
coincidences between catalogs of blazars and 
neutrinos have been examined in (64), while (65) 
investigated one shower-like event with several 
thousand square degrees angular uncertainty ob- 
served in time coincidence with a blazar outburst. 
A track-like event, IceCube-160731, has been pre- 
viously connected to a flaring y-ray source (66). 
However, the limited evidence for a flaring source 
in the multiwavelength coverage did not permit 
an identification of the source type of the poten- 
tial counterpart (66). 

Owing to the precise direction of IceCube- 
170922A, combined with extensive multiwave- 
length observations, a chance correlation between 
a high-energy neutrino and the potential coun- 
terpart can be rejected at the 30 level. Consid- 
ering the association between IceCube-170922A 
and TXS 0506+056, y-ray blazars are strong can- 
didate sources for at least a fraction of the ob- 
served astrophysical neutrinos. Earlier studies of 
the cross-correlation between IceCube events and 
the y-ray blazar population observed by Fermi-LAT 
demonstrated that these blazars can only pro- 
duce a fraction of the observed astrophysical 
neutrino flux above 10 TeV (4). Although these 
limits constrain the contribution from blazars to 
the diffuse neutrino background, the potential 
association of one or two high-energy neutrinos 
to blazars over the total observing time of IceCube 
is fully compatible with the constraint. 

Adopting standard cosmological parameters 
(67) Hp = 67.8, Qn = 0.308, Q, = 0.692, where 
Ho is the Hubble constant, Q,, is the matter 
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density, and Q, is the dark energy density, the 
observed redshift of zg = 0.3365 implies an iso- 
tropic y-ray luminosity between 0.1 and 100 GeV of 
1.3 x 10” ergs! in the +2 weeks around the ar- 
rival time of the IceCube neutrino, and a luminosity 
of 2.8 x 10“ ergs}, averaged over all Fermi-LAT 
observations. Observations in the optical, x-ray, 
and VHE y-ray bands show typical character- 
istics of blazar flares: strong variability on time 
scales of a few days and an indication of a shift 
of the synchrotron emission peak toward higher 
frequencies. VHE y-ray emission is found to change 
by a factor of ~4 within just 3 days. Similarly, 
the high-energy y-ray energy band shows flux 
variations up to a factor of ~5 from one week to 
the next. 

No other neutrino event that would have 
passed the selection criteria for a high-energy 
alert was observed from this source since the 
start of IceCube observations in May 2010. The 
muon neutrino fluence for which we would expect 
to detect one high-energy alert event with IceCube 
in this period of time is 2.8 x 107? erg cm~?. A 
power-law neutrino spectrum is assumed in this 
calculation with an index of -2 between 200 TeV 
and 7.5 PeV, the range between the 90% CL lower 
and upper limits for the energy of the observed 
neutrino [see (25) for details]. 

The fluence can be expressed as an integrated 
energy flux if we assume a time period during 
which the source was emitting neutrinos. For a 
source that emits neutrinos only during the 
~6-month period corresponding to the duration 
of the high-energy y-ray flare, the corresponding 
average integrated muon neutrino energy flux 
would be1.8 x 107” erg cm~? s“!. Alternatively, 
the average integrated energy flux of a source 
that emits neutrinos over the whole observation 
period of IceCube (i.e., 7.5 years) would be 1.2 x 
10-" erg cm~? s~!. These two benchmark cases 
are displayed in Fig. 4. In an ensemble of faint 
sources with a summed expectation of order 1, 
we would anticipate observing a neutrino even 
if the individual expectation value is «1. This is 
expressed by the downward arrows on the neu- 
trino flux points in Fig. 4. 

The two cases discussed above correspond to 
average isotropic muon neutrino luminosities of 
7.2 x 10" erg s"! for a source that was emitting 
neutrinos in the ~6-month period of the high- 
energy y-ray flare, and 4.8 x 10% ergs"! fora 
source that emitted neutrinos throughout the 
whole observation period. This is similar to the 
luminosity observed in y-rays and thus broadly 
consistent with hadronic source scenarios (68). 

A neutrino flux that produces a high-energy 
alert event can, over time, produce many lower- 
energy neutrino-induced muons in IceCube. A 
study of neutrino emission from TXS 0506+056 
prior to the high-energy y-ray flare, based on the 
investigation of these lower-energy events, is re- 
ported in a companion paper (26). 
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NEUTRINO ASTROPHYSICS 


Neutrino emission from the direction 
of the blazar TXS 0506+056 prior to 
the IceCube-170922A alert 


IceCube Collaboration* + 


A high-energy neutrino event detected by IceCube on 22 September 2017 was coincident in 
direction and time with a gamma-ray flare from the blazar TXS 0506+056. Prompted by 
this association, we investigated 9.5 years of IceCube neutrino observations to search for 
excess emission at the position of the blazar. We found an excess of high-energy neutrino 
events, with respect to atmospheric backgrounds, at that position between September 2014 
and March 2015. Allowing for time-variable flux, this constitutes 3.50 evidence for neutrino 
emission from the direction of TXS 0506+056, independent of and prior to the 2017 flaring 
episode. This suggests that blazars are identifiable sources of the high-energy astrophysical 


neutrino flux. 


he origin of the highest-energy cosmic rays 
is believed to be extragalactic (J), but their 
acceleration sites remain unidentified. High- 
energy neutrinos are expected to be pro- 
duced in or near the acceleration sites when 
cosmic rays interact with matter and ambient 
light, producing charged mesons that decay into 
neutrinos and other particles. Unlike cosmic rays, 
neutrinos can travel through the Universe un- 
impeded by interactions with other particles and 
undeflected by magnetic fields, providing a means 
to identify and study the extreme environments 
producing cosmic rays (2). Blazars, a class of active 
galactic nuclei with powerful relativistic jets 
pointed close to our line of sight (3), are prom- 
inent candidate sources of such high-energy 
neutrino emission (4-9). The electromagnetic 
emission of blazars is observed to be highly var- 
jable on time scales from minutes to years (0). 
The IceCube Neutrino Observatory (1) is a 
high-energy neutrino detector occupying an in- 
strumented volume of 1 km? within the Antarctic 
ice sheet at the Amundsen-Scott South Pole Sta- 
tion. The detector consists of an array of 86 
vertical strings, nominally spaced 125 m apart 
and descending to a depth of approximately 
2450 m in the ice. The bottom 1 km of each string 
is equipped with 60 optical sensors that record 
Cherenkov light emitted by relativistic charged 
particles passing through the optically transpar- 
ent ice. When high-energy muon neutrinos in- 
teract with the ice, they can create relativistic 
muons that travel many kilometers, creating a 
track-like series of Cherenkov photons recorded 
when they pass through the array. This allows the 
reconstruction of the original neutrino direction 


*lceCube Collaboration authors and affiliations are listed in the 
supplementary materials. 
tEmail: analysis@icecube.wisc.edu 
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with a median angular uncertainty of 0.5° for a 
neutrino energy of ~30 TeV (or 0.3° at 1 PeV) 
(72, 13). 

IceCube discovered the existence of a diffuse 
flux of high-energy astrophysical neutrinos in 
2013 (14, 15). Measurements of the energy spec- 
trum have since been refined (J6, 17), indicating 
that the neutrino spectrum extends above several 
PeV. However, analyses of neutrino observations 
have not succeeded in identifying individual 
sources of high-energy neutrinos (12, 78). This 
suggests that the sources are distributed across 
the sky and that even the brightest individual 
sources contribute only a small fraction of the 
total observed flux. 

Recently, the detection of a high-energy neutri- 
no by IceCube, together with observations in 
gamma rays and at other wavelengths, indicates 
that a blazar, TXS 0506+056, located at right ascen- 
sion (RA) 77.3582° and declination (Dec) +5.69314° 
(J2000 equinox) (19) may be an individually iden- 
tifiable source of high-energy neutrinos (20). The 
neutrino-candidate event, IceCube-170922A, was 
detected on 22 September 2017, selected by the 
Extremely High Energy (EHE) online event filter 
(21), and reported as a public alert (22). EHE 
alerts are currently sent at a rate of about four 
per year, and are based on well-reconstructed, 
high-energy muon-track events. The selection 
threshold is set so that approximately half of 
the events are estimated to be astrophysical neu- 
trinos, the rest being atmospheric background 
events. After the alert was sent, further studies 
refined the directional reconstruction, with best- 
fitting coordinates of RA 77.4308? and Dec 
+5.7270 55 (degrees, J2000, 90% containment 
region). The most probable neutrino energy was 
estimated to be 290 TeV, with a 90% confidence 
level lower limit of 183 TeV (20). 

It was soon determined that the direction of 
IceCube-170922A was consistent with the loca- 
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tion of TXS 0506+056 and coincident with a 
state of enhanced gamma-ray activity observed 
since April 2017 (23) by the Large Area Telescope 
(LAT) on the Fermi Gamma-ray Space Telescope 
(24). Follow-up observations of the blazar led to 
the detection of gamma rays with energies up to 
400 GeV by the Major Atmospheric Gamma 
Imaging Cherenkov (MAGIC) Telescopes (25, 26). 
IceCube-170922A and the electromagnetic obser- 
vations are described in detail in (20). The sig- 
nificance of the spatial and temporal coincidence 
of the high-energy neutrino and the blazar flare 
is estimated to be at the 3o level (20). On the 
basis of this result, we consider the hypothesis 
that the blazar TXS 0506+056 has been a source 
of high-energy neutrinos beyond that single event. 


Searching for neutrino emission 


IceCube monitors the whole sky and has main- 
tained essentially continuous observations since 
5 April 2008. Searches for neutrino point sources 
using two model-independent methods, a time- 
integrated and a time-dependent unbinned max- 
imum likelihood analysis, have previously been 
published for the data collected between 2008 
and 2015 (12, 18, 27). Here, we analyze the same 
7-year data sample supplemented with additional 
data collected from May 2015 until October 2017 
(21). The data span 9.5 years and consist of six 
distinct periods, corresponding to changing detec- 
tor configurations, data-taking conditions, and 
improved event selections (Table 1). 

The northern sky, where TXS 0506+056 is 
located, is observed through Earth by IceCube. 
Approximately 70,000 neutrino-induced muon 
tracks are recorded each year from this hemi- 
sphere of the sky after passing the final event 
selection criteria. Fewer than 1% of these events 
originate from astrophysical neutrinos; the vast 
majority are background events caused by neu- 
trinos of median energy ~1 TeV created in cosmic 
ray interactions in the atmosphere over other 
locations on Earth. However, for an astrophysical 
muon-neutrino flux where the differential num- 
ber of neutrinos with energy F scales as dN/dE ~ 
E~, the distribution of muon energies is different 
than for the background atmospheric neutrino 
flux, which scales as ~E~*” (17). This allows for 
further discriminating power in point source 
searches besides directional-only excesses. 

A high-significance point source detection 
(12, 18) can require as few as two or three, or as 
many as 30, signal events to stand out from the 
background, depending on the energy spectrum 
and the clustering of events in time. To search 
for a neutrino signal at the coordinates of TXS 
0506+056, we apply the standard time-integrated 
analysis (28) and time-dependent analysis (29) 
that have been used in past searches (12, 18, 27). 
The time-integrated analysis uses an unbinned 
maximum likelihood ratio method to search for 
an excess number of events consistent with a 
point source at a specified location, given the 
angular distance and angular uncertainty of each 
event. Energy information is included in the def- 
inition of the likelihood, assuming a power-law 
energy spectrum E” , with the spectral index y 
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as a fitted parameter. The model parameters are 
correlated and are expressed as a pair, (®,g0, Y), 
where ®,99 is the flux normalization at 100 TeV. 
The time-dependent analysis uses the same for- 
mulation of the likelihood but searches for 
clustering in time as well as space by introducing 
an additional time profile. It is performed sep- 
arately for two different generic profile shapes: a 
Gaussian-shaped time window and a box-shaped 
time window. Each analysis varies the central 
time of the window, 7, and the duration Tw 
(from seconds to years) of the potential signal to 
find the four parameters (®g9, y, To, Tw) that 
maximize the likelihood ratio, which is defined 
as the test statistic TS. (For the Gaussian time 
window, Tw represents twice the standard de- 
viation.) The test statistic includes a factor that 
corrects for the look-elsewhere effect arising 
from all of the possible time windows that could 
be chosen (30). 

For each analysis method (time-integrated and 
time-dependent), a robust significance estimate is 
obtained by performing the identical analysis on 
trials with randomized datasets. These are produced 
by randomizing the event times and recalculating 
ee ees! 


Table 1. IceCube neutrino data samples. 
Six data-taking periods make up the full 
9.5-year data sample. Sample numbers 
correspond to the number of detector 
strings that were operational. During the 
first three periods, the detector was still 
under construction. The last three periods 
correspond to different data-taking 
conditions and/or event selections with the 
full 86-string detector. 


Sample Start End 
Ue UO ee CO A 
IC59 20 May 2009 31 May 2010 


IC86p_ 16 May 201218 May 2015 | 
IC86c 18 May 2015 = 31 October 2017 
IC40 IC59 IC79 
5 
s#=#' IceCube-170922A 
~ 4-1 —— Gaussian Analysis 
o 34. ——— Box-shaped Analysis 
D 
=. 2 
I 
0 


2009 


2010 


Fig. 1. Time-dependent analysis results. The orange curve corresponds 
to the analysis using the Gaussian-shaped time profile. The central time To 
and width Tw are plotted for the most significant excess found in each 
period, with the P value of that result indicated by the height of the peak. 
The blue curve corresponds to the analysis using the box-shaped time 
profile. The curve traces the outer edge of the superposition of the best- 
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2011 


the RA coordinates within each data-taking period. 
The resultant P value is defined as the fraction of 
randomized trials yielding a value of TS greater than 
or equal to the one obtained for the actual data. 

Because the detector configuration and event 
selections changed as shown in Table 1, the time- 
dependent analysis is performed by operating on 
each data-taking period separately. (A flare that 
spans a boundary between two periods could be 
partially detected in either period, but with re- 
duced significance.) An additional look-elsewhere 
correction then needs to be applied for a result in 
an individual data segment, given by the ratio of 
the total 9.5-year observation time to the obser- 
vation time of that data segment (30). 


Neutrinos from the direction of 
TXS 0506+056 


The results of the time-dependent analysis per- 
formed at the coordinates of TXS 0506+056 are 
shown in Fig. 1 for each of the six data periods. 
One of the data periods, IC86b from 2012 to 2015, 
contains a significant excess, which is identified 
by both time-window shapes. The excess consists 
of 13 + 5 events above the expectation from the 
atmospheric background. The significance depends 
on the energies of the events, their proximity to 
the coordinates of TXS 0506+056, and their 
clustering in time. This is illustrated in Fig. 2, 
which shows the time-independent weight of 
individual events in the likelihood analysis during 
the IC86b data period. 

The Gaussian time window is centered at 13 
December 2014 [modified Julian day (MJD) 57004] 
with an uncertainty of +21 days and a duration 
Tw = 110733 days. The best-fitting parameters for 
the fluence Jio9 = J®,o0(¢)dt and the spectral 
index are given by E*Jig9 =2.1°02 x 10-*TeVem™ 
at 100 TeV and y = 2.1 + 0.2, respectively. The 
joint uncertainty on these parameters is shown 
in Fig. 3 along with a skymap showing the result 
of the time-dependent analysis performed at the 
location of TXS 0506+056 and in its vicinity 
during the IC86b data period. 

The box-shaped time window is centered 
13 days later with duration Tw = 158 days (from 
MJD 56937.81 to MJD 57096.21, inclusive of 


IC86a IC86b 


2012 2013 2014 
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2015 


contributing events at boundary times). For the 
box-shaped time window, the uncertainties are 
discontinuous and not well defined, but the un- 
certainties for the Gaussian window show that it 
is consistent with the box-shaped time window 
fit. Despite the different window shapes, which 
lead to different weightings of the events as a 
function of time, both windows identify the same 
time interval as significant. For the box-shaped 
time window, the best-fitting parameters are sim- 
ilar to those of the Gaussian window, with fluence 
at 100 TeV and spectral index given by E7Jio9 = 
2.2*19 x 10-* TeV cm™ and y = 2.2 + 0.2. This 
fluence corresponds to an average flux over 
158 days of Pyo9 = 1.627 x 10-% TeV cm™ s”. 
When we estimate the significance of the time- 
dependent result by performing the analysis at 
the coordinates of TXS 0506+056 on randomized 
datasets, we allow in each trial a new fit for all 
the parameters: 190, y, To, Tw. We find that the 
fraction of randomized trials that result in a more 
significant excess than the real data is 7 x 10° for 
the box-shaped time window and 3 x 10° for the 
Gaussian time window. This fraction, once cor- 
rected for the ratio of the total observation time 
to the IC86b observation time (9.5 years/3 years), 
results in P values of 2 x 10~* and 10%, respec- 
tively, corresponding to 3.50 and 3.70. Because 
there is no a priori reason to prefer one of the 
generic time windows over the other, we take the 
more significant one and include a trial factor of 
2 for the final significance, which is then 3.50. 
Outside the 2012-2015 time period, the next 
most significant excess is found using the Gauss- 
ian window in 2017 and includes the IceCube- 
170922A event. This time window is centered 
at 22 September 2017 with duration Tw = 19 days, 
y = 17+ 0.6, and fluence E” Jig) = 0.2104 x 10-4 
TeV cm ~ at 100 TeV. No other event besides the 
IceCube-170922A event contributes significantly 
to the best fit. As a consequence, the uncertainty 
on the best-fitting window location and width 
spans the entire IC86c period, because any win- 
dow containing IceCube-170922A yields a similar 
value of the test statistic. Following the trial cor- 
rection procedure for different observation periods 
as described above, the significance of this excess 


IC86c 


2016 2017 


fitting time windows (durations Ty) over all times To, with the height 
indicating the significance of that window. In each period, the most 
significant time window forms a plateau, shaded in blue. The large blue 
band centered near 2015 represents the best-fitting 158-day time window 
found using the box-shaped time profile. The vertical dotted line in IC86c 
indicates the time of the IceCube-170922A event. 
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is 140. If the IceCube-170922A event is removed, 
no excess remains during this time period. This 
agrees with the result of the rapid-response anal- 
ysis (31) that is part of the IceCube alert program, 
which found no other potential astrophysical 
neutrinos from the same region of the sky during 
+7 days centered on the time of IceCube-170922A. 

We performed a time-integrated analysis at 
the coordinates of TXS 0506+056 using the full 
9.5-year data sample. The best-fitting parameters 
for the flux normalization and the spectral index 
are Pio = 0.805 x 10-8 TeV" cm™ s™ and y = 
2.0 + 0.3, respectively. The joint uncertainty on 
these parameters is shown in Fig. 4A. The P value, 
based on repeating the analysis at the same co- 
ordinates with randomized datasets, is 0.002% 
(4.10), but this is an a posteriori significance 
estimate because it includes the IceCube-170922A 
event, which motivated performing the analysis at 
the coordinates of TXS 0506+056. An unbiased 


significance estimate including the event would 
need to take into account the look-elsewhere effect 
related to all other possible directions in the sky 
that could be analyzed. It is expected that there 
will be two or three directions somewhere in the 
northern sky with this significance or greater, 
resulting from the chance alignment of neutri- 
nos (12). Here, we are interested in determining 
whether there is evidence of time-integrated neu- 
trino emission from TXS 0506+056 besides the 
IceCube-170922A event. 

If we remove the final data period IC86c, which 
contains the event, and perform the analysis 
again using only the first 7 years of data, we find 
best-fitting parameters that are nearly unchanged: 
Dyo9 = 0.9798 x 10° TeV 'em™ st andy = 2.1 
0.3, respectively. The joint uncertainty on these 
parameters is shown in Fig. 4B. The P value, using 
only the first 7 years of data, is 1.6% (2.10), based 
on repeating the analysis at the same coordinates 
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Fig. 2. Time-independent weight of individual events during the IC86b period. Each vertical line 
represents an event observed at the time indicated by calendar year (top) or MJD (bottom). 
Overlapping lines are shifted by 1 to 2 days for visibility. The height of each line indicates the event 
weight: the product of the event's spatial term and energy term in the unbinned likelihood analysis 
evaluated at the location of TXS 0506+056 and assuming the best-fitting spectral index y = 2.1 
(30). The color for each event indicates an approximate value in units of TeV of the reconstructed muon 
energy (muon energy proxy), which the analysis compares with expected muon energy distributions 
under different hypotheses. [A distribution for the true neutrino energy of a single event can also 

be inferred from the event's muon energy (30).] The dashed curve and the solid bracket indicate the 
best-fitting Gaussian and box-shaped time windows, respectively. The distribution of event weights 
and times outside of the best-fitting time windows is compatible with background. 


Fig. 3. Time-dependent analy- 25 
sis results for the IC86b data 
period (2012-2015). 

(A) Change in test statistic, 

ATS, as a function of the spectral 
index parameter y and the fluence 
at 100 TeV given by E*Jigo. The 
analysis is performed at the 
coordinates of TXS 0506+056, 
using the Gaussian-shaped time 
window and holding the time 
parameters fixed (Tp = 13 
December 2014, Tw = 110 days). 
The white dot indicates the best- 0.0 
fitting values. The contours at 

68% and 95% confidence level 

assuming Wilks’ theorem (36) are 

shown in order to indicate the statistical uncertainty on the parameter 
estimates. Systematic uncertainties are not included. (B) Skymap showing 
the P value of the time-dependent analysis performed at the coordinates of 
TXS 0506+056 (cross) and at surrounding locations. The analysis is 
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4.5 


3.0 
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with randomized datasets. These results indicate 
that the time-integrated fit is dominated by the 
same excess as found in the time-dependent 
analysis above, having similar values for the 
spectral index and total fluence (E7Jjo9 = 2.0 x 
10°* TeV cm” at 100 TeV over the 7-year period). 
This excess is not significant in the time-integrated 
analysis because of the additional background 
during the rest of the 7-year period. 


Blazars as neutrino sources 


The signal identified during the 5-month period 
in 2014-2015 consists of an estimated 13 + 5 
muon-neutrino events that are present in addi- 
tion to the expected background. The analysis is 
unbinned, but the mean background at the dec- 
lination of TXS 0506+056 is useful for compar- 
ison purposes; it is 5.8 events in a search bin of 
radius 1° during a 158-day time window. (We use 
the duration of the box-shaped time window re- 
sult for convenience to calculate averages during 
the flare.) The significance of the excess is due to 
both the number of events and their energy 
distribution, with higher-energy events increasing 
the significance and leading to the best-fitting 
spectral index of 2.1, in contrast to the lower- 
energy atmospheric neutrino background with 
spectral index ~3.7. At this declination in the sky, 
the 68% central energy range in which IceCube is 
most sensitive to point sources with E~°" spectra 
is between 32 TeV and 3.6 PeV. Assuming that 
the muon-neutrino fluence (E*Jyo9 = 2.1119 x 
10-* TeV cm”) is one-third of the total neu- 
trino fluence, then the all-flavor neutrino energy 
fluence is 4.2*?° x 10-? erg cm™ over this 
energy range. With the recent measurement (32) 
of the redshift of TXS 0506+056 as z = 0.3365 + 
0.0010, this energy fluence implies that the iso- 
tropic neutrino luminosity is 1.2*°$ x 10” erg s* 
averaged over 158 days. This is higher than the 
isotropic gamma-ray luminosity during the same 
period, which is similar to the long-term luminosity 
between 0.1 GeV and 100 GeV of 0.28 x 10“ erg 
s‘ averaged over all Fermi-LAT observations of 
TXS 0506+056 (20). Gamma rays are expected to 
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Declination 


78.36° 77.36° 76.36° 


Right Ascension 


performed on the IC86b data period, using the Gaussian-shaped time window. 
At each point, the full fit for (®, y, To, Tw) is performed. The P value shown 
does not include the look-elsewhere effect related to other data periods. An 
excess of events is detected, consistent with the position of TXS 0506+056. 
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Fig. 4. Time-integrated analy- 
sis results. As in Fig. 3A, but for 
the time-integrated analysis of 
TXS 0506+056 using (A) the full 
9.5-year sample (2008-2017), 
and (B) the 7-year sample 
(2008-2015). 


A 2.0 


® 199 [10-16 TeV-!'cm~?s~!] 


be produced in the same processes that produce 
neutrinos—for example, when accelerated pro- 
tons interact with ambient lower-energy photons 
near the source, producing both neutral pions 
(which decay to gamma rays) and charged 
pions (which decay to neutrinos and leptons). A 
higher luminosity in neutrinos than in gamma 
rays could imply that a substantial fraction of 
the gamma rays related to the neutrino produc- 
tion are either absorbed or arriving at energies 
above or below the Fermi-LAT energy band. 

Although TXS 0506+056 is a bright object in 
gamma rays, it was not previously singled out as 
a predicted neutrino source. In the third catalog 
of active galactic nuclei detected by Fermi-LAT 
(33) listing 1773 objects (including those at low 
galactic latitudes), TXS 0506+056 is among the 
50 brightest objects, with an average flux be- 
tween 1 GeV and 100 GeV of 6.5 (+0.2) x 10°° 
photons cm~ s”7. Its measured redshift now 
makes it one of the most luminous objects known 
out to the same distance, more than an order of 
magnitude more luminous than nearby blazars 
such as Markarian 421, Markarian 501, and 1ES 
1959+650. With respect to these objects, an 
important observational distinction is the fa- 
vorable declination of TXS 0506+056. As the 
neutrino-nucleon interaction cross section grows 
with energy, absorption in Earth becomes con- 
siderable for neutrinos above ~100 TeV. IceCube 
is most sensitive to high-energy neutrinos 
from sources at declinations near the equatorial 
plane, which is viewed along the horizon from 
the South Pole. The blazars mentioned above 
are at more northern declinations, and the like- 
lihood that a neutrino with energy of ~300 TeV 
from one of these will be absorbed while travers- 
ing Earth is three to five times the likelihood that 
it will reach the detector. The explanation for 
why TXS 0506+056 is the first blazar associated 
with a significant neutrino excess may there- 
fore depend on the combination of its intrinsic 
properties and the observational properties of 
the detector. 

IceCube recently published (34) a search for 
neutrino emission from the blazars in the second 
catalog of active galactic nuclei detected by Fermi- 
LAT (35), constraining their contribution to the 
diffuse astrophysical neutrino flux under different 
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model assumptions. An upper limit of 27% was 
found assuming the diffuse flux that is fit between 
10 TeV and 100 TeV with a soft E~*° spectrum 
(16). For an E~ spectrum compatible with the 
diffuse flux fit above ~200 TeV (17), the upper 
limit is between 40% and 80%. The allowed con- 
tribution by blazars as a population is larger, be- 
cause it would include the contribution of fainter 
and more distant blazars not yet resolved in the 
catalog. Averaged over 9.5 years, the neutrino flux 
of TXS 0506+056 by itself corresponds to 1% of 
the astrophysical diffuse flux and is fully com- 
patible with the previous blazar catalog results. 

The evidence presented above supports the 
hypothesis presented in (20) that the blazar TXS 
0506+056 is a high-energy neutrino source. The 
3.50 evidence for neutrino emission during the 
5-month period in 2014-2015 is statistically in- 
dependent of the evidence presented in (20). 
The analysis of the IceCube-170922A event in 
(20) relies on correlation of a single neutrino with 
electromagnetic activity, whereas the analysis 
presented here relies only on self-correlation 
of multiple neutrinos. The coincidence of an 
IceCube alert with a flaring blazar, combined 
with a neutrino flare from the same object in 
archival IceCube data, pinpoints a likely source 
of high-energy cosmic rays. 
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FERROELECTRICITY 


Metal-free three-dimensional 
perovskite ferroelectrics 


Heng-Yun Ye™}, Yuan-Yuan Tang", Peng-Fei Li’*, Wei-Qiang Liao’, Ji-Xing Gao’, 
Xiu-Ni Hua’, Hu Cai”, Ping-Ping Shi”*, Yu-Meng You't, Ren-Gen Xiong””+ 


Inorganic perovskite ferroelectrics are widely used in nonvolatile memory elements, 
capacitors, and sensors because of their excellent ferroelectric and other properties. 
Organic ferroelectrics are desirable for their mechanical flexibility, low weight, 
environmentally friendly processing, and low processing temperatures. Although almost 
a century has passed since the first ferroelectric, Rochelle salt, was discovered, examples 
of highly desirable organic perovskite ferroelectrics are lacking. We found a family of 
metal-free organic perovskite ferroelectrics with the characteristic three-dimensional 
structure, among which MDABCO (N-methylI-N'-diazabicyclo[2.2.2]octonium)—ammonium 
triiodide has a spontaneous polarization of 22 microcoulombs per square centimeter 
[close to that of barium titanate (BTO)], a high phase transition temperature of 448 kelvins 
(above that of BTO), and eight possible polarization directions. These attributes make it 
attractive for use in flexible devices, soft robotics, biomedical devices, and other applications. 


he word “perovskite” is derived from the 
mineral CaTiO;, which was first discov- 
ered by Gustav Rose in 1839 (7). Magnesium 
silicate perovskite and calcium silicate pe- 
rovskite are also the most abundant min- 
erals in Earth’s interior (2). Typical perovskites 
have a distinct three-dimensional (83D) ABX, 
crystal structure, in which the A cations are lo- 
cated in the 3D corner-sharing framework of 
BXg octahedrons (B represents the other cation 
and X is an anion) (3). There are hundreds of 
known perovskites, constituting an essential 
class of functional materials in optoelectronics 
and microelectronics. Among their numerous 
attractive properties, ferroelectricity, which is 
the ability to switch spontaneous polarization 
(P,) under an applied electric field, is of partic- 
ular interest for theoretical studies and is im- 
portant for a variety of applications (4-7). 
Inorganic perovskites, such as BaTiO; (BTO) 
and Pb(Zr,Ti)O3, dominate applications such as 
ferroelectric memories, piezoelectric sensors, ac- 
tuators, capacitors, and nonlinear optical devices 
(8-10). However, the requirements for practical 
materials to be energy efficient, economically in- 
expensive, and environmentally friendly (“triple 
E”) motivate the exploration for nontoxic, low- 
cost, and simple alternatives to inorganic pe- 
rovskites (JJ). Organic compounds in general 
and hybrid organic-inorganic perovskites in 
particular are attractive alternatives. These ma- 
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terials are lightweight, low cost, mechanically 
flexible, structurally tunable, and easy to pro- 
cess. These properties make organics attractive 
for materials science and technology applica- 
tions (12, 13). 

Several different hybrid halide organic-inorganic 
perovskites have notable properties. Hybrid halide 
perovskites such as (CH3NH3)PbI; and (CH3NH3) 
PbBrz have attracted much attention in the solar 
cell and light-emitting diode (LED) fields because 
of their outstanding photovoltaic and lumines- 
cent performance (14-19). A few organic-inorganic 
perovskite ferroelectrics containing halides, cy- 
anides, and formates were developed as com- 
petitive candidates for microelectromechanical 
systems (MEMS) (20-25). We recently synthe- 
sized trimethylchloromethyl ammonium trichloro- 
manganese(II), which has a large piezoelectric 
response and displays ferroelectric properties 
comparable to those of BTO (/3). 

An underexplored but intriguing area for de- 
veloping ferroelectric materials is metal-free, or 
organic, perovskites. These materials have an 
organic cation occupying the B site of the crys- 
tal. The organic composition should allow for 
wide chemical diversity, structural flexibility, and 
additional functionalities that are difficult to 
achieve with inorganic perovskites. The concerns 
with organic perovskites are that the absence 
of the metal ion will affect the stability of the 3D 
perovskites and they may not be good ferroelec- 
tric materials. In 2002, a molecular perovskite 
hydrate, (C,Hj.N2)(NH,Cl3)-H.O, featuring the 
piperazinium cation (C,H,.N>)”* and water mol- 
ecules enclosing the 3D corner-sharing [(NH,)Cl¢] 
network, was described (26). The discovery 
marked the beginning for metal-free perov- 
skites, although this perovskite has an ABX3C 
rather than a real ABX; structure (26, 27). Such 
a system is promising for constructing noncentro- 
symmetric or polar structures and homochiral 
analogs. However, 15 years after the initial dis- 


covery of metal-free perovskites, no observations 
of ferroelectricity or other functionalities have 
been reported. 

We used a molecular design strategy (see sup- 
plementary materials) (28) and elaborate selec- 
tion of the organic cations to develop a family of 
metal-free ABX3-type 3D perovskite ferroelec- 
trics, with the general formula of A(NH,)X3 
(where A is a divalent organic cation and X is 
Cl, Br, or I, as listed in Fig. 1A). We synthesized 
a total of 23 different members of this family 
with a range of different structures (tables S1 
to S9) and phase transition temperatures (table 
$10). Among the first A(NH,)X3 perovskites we 
discovered, MDABCO-NH,I; (MDABCO is N- 
methyl-N’-diazabicyclo[2.2.2]octonium) has a 
high phase transition temperature (7) of 448 K 
and a large P,; of 22 wC/em”, which are compa- 
rable to those of BTO (table S11). Furthermore, 
using piezoresponse force microscopy (PFM), 
we discovered the coexistence of various ferro- 
electric domains with eight polarization direc- 
tions and evidence for the flexible rotation of 
polarization directions by the application of an 
electric field. Our strategy demonstrates the via- 
bility of high-performance metal-free ferroelec- 
tric perovskites. Without the metal element, such 
organics are expected to be on par with their 
inorganic and organic-inorganic counterparts, 
enhanced by the high flexibility, tunable structure- 
property relationships, possible solution and vacu- 
um processes, and biocompatibility of organic 
materials. 


General structures 


Our metal-free 3D perovskite ferroelectrics are 
all the classical ABX3 type without water, with 
3D networks of corner-shared (NH,)X¢ (X is Cl, 
Br, or I) octahedra that enclose cavities occupied 
by the divalent organic ammonium cations, and 
all components are held together by ionic and 
hydrogen-bonding interactions (Fig. 1 and figs. 
S1 to $18). We collected x-ray diffraction data and 
determined the crystal data for the structures 
(tables S4 to S9). Most of these metal-free pe- 
rovskite structures crystallize in the ferroelectric 
space group at room temperature. We summa- 
rize the space groups of their ferroelectric and 
paraelectric phases and the Curie temperatures 
in table S10. 

We evaluated ionic size mismatches in this 
large class of metal-free perovskite compounds 
by calculating the geometrical parameter a = 
(Ra + Rx)/\/2(Rg + Rx). This is the tolerance 
factor introduced by V. M. Goldschmidt in 1926 
(29). According to the definition, R; is the radii 
of the ions in the perovskite ABX3, where i is A, 
B, and X, respectively. We obtained the radii of 
Ry, from experimental single-crystal x-ray diffrac- 
tion. All of the tolerance factors for the presented 
divalent cations in the NH,X, cavities range be- 
tween 0.87 and 1.00 (fig. S19 and table S12). For 
values of o in the range of 0.9 to 1.0, we found 3D 
perovskites with greater probability, consistent 
with the previous analysis (30). Values of 0.87 
to 0.89 lead to 1D hexagonal structures. A stable 
3D perovskite structure does not depend solely 
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on ionic size mismatching. It also requires inter- 
molecular interactions, such as van der Waals 
forces and hydrogen bonding, to become predo- 
minant in the pure organic perovskite materials. 
The 3D perovskite structure of DABCO-NH,Brz 
(DABCO is N-N'-diazabicyclo[2.2.2]octonium) 
with o = 0.88 is a good demonstration of the 
comprehensive and sophisticated factors in- 
volved in the stable formation of the 3D pe- 
rovskite structure. 


MDABCO-NHals 
Ferroelectric phase 


MDABCO-NH,I, has a relatively simple 3D struc- 
ture (a unit cell contains just one molecular unit) 
(fig. SI), and thus we take it here as an example to 
elucidate the ferroelectric mechanism. At room 
temperature, MDABCO-NH4,I, crystallizes in the 
polar space group R3, Z = 1 (where Z is the num- 
ber of formula units), with unit cell dimensions 
a=b=c=7.259(4) A and a = B = y = 84.767(4)° 
(where numbers in parentheses indicate un- 
certainty). In the cage-like unit cell, the MDABCO 
cation locates at the center, NH,,* groups locate at 
vertexes, and I’ ions are at the centers of the 
edges. According to the Curie symmetry super- 
position principle, the symmetry group of the 
specific assembly mode of the MDABCO-NH,I, 
crystal adopts the maximal common subgroup 
of its composition molecular symmetry groups. 
In the crystal of MDABCO-NH,J;, MDABCO”* 
and NH," are arranged alternately along the crys- 
tallographic [1 1 1] direction (Fig. 1B), and both 
the intramolecular C; axes of these two molecules 
are superimposed with the crystallographic C3 
axis. As the symmetry groups of the MDABCO”* 
and NH,” molecules are C3 and Ty, respectively, 
the maximal common subgroup will be the C; 
point group. During the molecular assembly, the 
symmetry elements of 4 and m from point group 
Tq disappear, and both symmetry elements of 3 
from point groups 7, and C; are preserved. The 
resultant polar point group symmetry C3; should 
lead to the generation of spontaneous polarization 
along the [1 1 1] direction, because the MDABCO”* 
molecule carries a dipole moment along the 
molecular threefold axis. The polar point group 
in such metal-free frameworks is not expected 
when we replace MDABCO** with DABCO”*, be- 
cause of the Ds, symmetry of DABCO?*. While in 
the paraelectric phase, the disordered MDABCO”* 
molecule can be regarded as a special spherical 
configuration with O point group symmetry 
composed of all eight orientations possible in 
the ferroelectric phase. Such dynamically dis- 
ordered behavior of the MDABCO** molecule is 
responsible for the 432 symmetry of its para- 
electric phase. 


Paraelectric phase 


At Ty = 448 K, MDABCO-NH,I, transforms into 
the cubic paraelectric phase, which features the 
space group P432, a = 7.516(13) A, Z =1. In the 
paraelectric phase, MDABCO becomes disor- 
dered and the (NH,)I, octahedron becomes reg- 
ular (fig. SIB). MDABCO is located on the special 


site of 432 symmetry. The high-symmetry site of 
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Fig. 1. Chemical and crystal structures of the metal-free A(NH,)X3 family. (A) Chemical structures 
of constituents of the metal-free 3D perovskite ferroelectrics. (B) The packing diagram of MDABCO- 
NHgls in the ferroelectric phase at 293 K. The oval to the right contains the space-fill diagram of the 
organic cation, showing the cationic geometry to be close to a ball. (©) The packing diagram of 
MDABCO-NHals in the paraelectric phase at 463 K. 


432 requires total disorder of the organic cation, 
and accordingly, we modeled MDABCO with a 
spherical structure discarding its chemical sense 
(Fig. 1C). This suggests that the MDABCO cations 
are nearly freely rotating in the high-temperature 
paraelectric phase. Dynamic disorder of or- 
ganic molecules in crystals is usually observed 
for those molecules with spherical geometries, 
such as tetramethylammonium, adamantane, 
and DABCO. Compared with that of DABCO, the 
shape of MDABCO deviates just slightly from 
the spherical geometry (Fig. 1B, blue oval). Such 
a molecular geometry is the key element for 
the generation of ferroelectricity. On the one 
hand, the quasispherical geometry makes mo- 
lecular reorientation easy, which is necessary 
for ferroelectric polarization reversal (9). On the 
other hand, the lowering of molecular symmetry 
through the attachment of a methyl group to 
DABCO leads to the generation of dipole mo- 
ments, whose alignment results in the symmetry 
breaking and contributes to the polarization of 
the crystal. According to the symmetry change, 
MDABCO-NHy,I; belongs to the ferroelectric spe- 
cies of 432F3 (31). 

Because of the high symmetry of DABCO, the 
DABCO-containing molecular analogs, such as 
DABCO-(NH,)Brs (fig. $13) and DABCO-RbCl 
(27), do not have ferroelectricity. Additionally, 
the suitable size of the MDABCO ion allows it to 
template the 3D perovskite structure. Meanwhile, 
we also isolated hexagonal organic perovskites 


DABCO-(NH,)X3 (where X is Cl or I) and found 
that these materials do not show spontaneous 
polarization (figs. $14 and S15). 


General characterizations 


A pair of reversible thermal anomalies in the dif- 
ferential scanning calorimetry (DSC) curves reveal 
a phase transition around 448 K for MDABCO- 
NH,I; (Fig. 2A). This Tp is higher than those of 
the vast majority of high-temperature molecular 
ferroelectrics and BTO (393 K). We estimated 
the entropy change (AS) accompanying the phase 
transition to be ~50 J mol’ K” The main con- 
tribution of the high AS should be from the 
transition of the MDABCO cation from a freely 
rotating state to a static state. Meanwhile, we 
also confirmed the phase transition by the 
temperature-dependent second-harmonic genera- 
tion (SHG) response (fig. S20). The detectable 
SHG intensity vanishes for temperatures above 
To, supporting the notion of a phase transition 
from the polar 3 point group into a nonpolar 
432 one, which should have null SHG response 
under the restriction of Kleinman symmetry (32). 

The real part (e') of the complex dielectric con- 
stant (€ = e' - te”, where 7 is the imaginary unit 
and e” is the imaginary part of €) as a function 
of temperature at various frequencies shows a 
prominent A-shape peak anomaly around 75 
along the [111] direction (Fig. 2B), indicative of 
a proper ferroelectric transition. The peak value 
increases with decreasing frequency and reaches 


2 of 5 


8102 ‘Z} Ainr uo /Bio'beweouslos‘eoua!0s//:dyjy wos} papeojumoq 


PEJIM3 NOATOTOBUJIA TPYNNA "What's News" VK.COM/WSNWS 


RESEARCH | RESEARCH ARTICLE 


A 
r) 
= 
= 
iS 
oO 
72) 
{=) 
Heating 
Cooling 
325 350 375 400 425 4650 
Temperature (K) 

Cc 

20 
m 
£ 
re 10 
= 
c 
s 
S 
© 
N 
‘= -10 
Ss 
[°) 
a 


E (kV/cm) 


4000 


400 420 440 
Temperature (K) 


320 340 360 380 400 420 440 460 


Temperature (K) 


2" 


: 


—=— Polarization 
—=— SHG 


Polarization (4C/cm’) 


330 360 390 420 450 
Temperature (K) 


Fig. 2. Ferroelectric properties of MDABCO-NHals. (A) The results of DSC measurements, 
revealing a structural phase transition at around To = 448 K. (B) The temperature dependence 

of the real part (e’) of the complex dielectric constant at selected frequencies. k indicates 
thousands, and M represents a million. (C) P-E hysteresis loops measured at various temperatures. 
(D) Spontaneous polarization P; and second-order nonlinear optical coefficient 7 as a function 


of temperature. a.u., arbitrary units. 


a large value of about 14,068 at 20 Hz. The di- 
electric response obeys the Curie-Weiss law well; 
that is, e = C/(T - T,), where « is the dielectric 
constant, Cis the Curie constant, T(T > T.) is the 
temperature, and T, is the Curie-Weiss temper- 
ature. The reciprocal value of e’ at 20 Hz asa 
function of temperature is linear, with the fitted 
Curie-Weiss constant C of 14,068 K, which is fur- 
ther evidence for a proper ferroelectric phase 
transition (fig. $21). 

We used the Berlincourt method (73) to mea- 
sure the piezoelectric coefficient d33 on the single 
crystal of MDABCO-NH,lI;. At room tempera- 
ture, we obtained a maximum d3; of ~14 pC/N 
along the [111] direction of the crystal (fig. S22). 
To investigate the optoelectronic potential, we 
also examined the optical absorption and emis- 
sion properties by ultraviolet-visible absorption 
and photoluminescence spectroscopies, which 
showed that most of the perovskites tested are 
wide-band gap insulators and only those contain- 
ing iodide can exhibit photoluminescence with 
the maximum emission wavelength of 561 nm 
(figs. S23 to 25). 


Ferroelectricity 


We further confirmed the ferroelectricity of 
MDABCO-NH,lIs by the measurement of 
polarization-electric field (P-E) hysteresis loops 
on the (11 1) plane recorded with the Sawyer- 
Tower circuit method (33) (Fig. 2C). We esti- 
mated the P, value at 373 K as 19 wC/cm?. The 
coercive field E, is in the range of 6 to 12 kV/cm, 
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comparable to that of BTO (10 kV/cm) and much 
smaller than that of polyvinylidene difluoride 
(PVDF) (500 kV/cm) (34). We also used pyro- 
electric measurements to evaluate the sponta- 
neous polarization P,. Spontaneous polarization 
appears below Tp (Fig. 2D) and then gradually 
increases to a saturated value of 22 wC/em?, which 
is close to the values of 23 .C/cm? for diisopropyl- 
ammonium bromide and 26 wC/cm? for BTO and 
greater than the 8 uC/cm? for PVDF and those 
for other organic-inorganic perovskite molec- 
ular ferroelectrics (35). The variation of P, has 
the same trend as the effective second-order non- 
linear optic susceptibility 7, in good agreement 
with the Landau relationship 7 = 6e oP, where B 
represents a coefficient almost independent of 
the temperature (36). The variations of both P, 
and y™ in the vicinity of Ty are abrupt, revealing 
a typically first-order phase transition. 


Spontaneous polarization 


The mechanism of ferroelectric spontaneous po- 
larization has different characteristics. Because 
of the ABX;-type perovskite structure and metal- 
free composition, the origin of polarization in 
MDABCO-NH,I; is more complicated than that 
in conventional inorganic perovskites, such as 
BTO, and organic-inorganic hybrid perovskites, 
such as (CH3;NH3)PbIs. 

In the classic ABO;-type inorganic perovskite, 
such as BTO, spontaneous polarization is in- 
duced by the off-centering displacement (OCD) 


of the small cation at the B site. Such phase 


transition is mainly displacive (37), with some 
contributions from order-disorder character- 
istics (38). Similar to the B-site cation in BTO, 
the B-site NH,” ion in MDABCO-NH,JI; under- 
goes 0.2734-A displacement from the center of 
the six surrounding I” ions during the phase 
transition because of the octahedral tilting near 
Tp. The OCD of the B-site NH," ion is determined 
along the [111] direction. Besides OCD of the 
B-site NH," ion, the charge transfer through the 
Nypa-H-:-I hydrogen bonds (where Nyy, indi- 
cates the N atom of NH," and --- indicates inter- 
action between H and J) should also contribute 
to the polarization. Those hydrogen-bonding in- 
teractions [dy.... (Hnya:-Nnya distance), 0.9 A; 
dyy.1 (HynaI distance), 2.78 A; ~NHI, 159.7°; 
and dy..., 3.637(9) A] (fig. S26) are formed in the 
ferroelectric phase with the ordering of hydrogen 
bonds within the (NH,)Ig octahedron. Thus, for 
the B site, both charge transfer and OCD contribute 
to the ferroelectric polarization. 

On the other hand, because of the metal-free 
composition and nature of molecular ferroelectrics, 
different from those of BTO, the A-site MDABCO 
also plays an important role in the generation of 
ferroelectricity. During the paraelectric-ferroelectric 
phase transition, A-site MDABCO changes from 
a nearly free rotation state to a static state. Such 
freezing of the dynamic motion tends to be dis- 
continuous (22, 39-41), and the ordering of the 
MDABCO cation leads not only to the alignment 
of the molecular dipole moment but also to sub- 
stantial displacement. For conventional order- 
disorder-type inorganic compounds or organic 
salts, such as NaNOo, KH2POx,, and triglycine 
sulfate (9, 10), the order-disorder transitions are 
usually continuous and displacements of those 
dipoles are small and contribute a little sponta- 
neous polarization accordingly. In MDABCO- 
NH4,lI;, the polarization direction induced by 
the displacement is the same as that of the mo- 
lecular dipole moment of MDABCO (fig. S27). 
The displacement of ~0.65 A is induced by the 
shift of the positive charge carried by MDABCO, 
which is accompanied by the elongation of the 
cubic cell along the body diagonal. In this sense, 
the ferroelectric contribution of the A site is 
essentially the displacive type. 

The new class of metal-free perovskite ferro- 
electrics has a distinct polarization origin, differ- 
ent not only from that of conventional inorganic 
perovskites but also from that of organic-inorganic 
hybrid perovskites, such as (CH3NH3)PbI3. The 
order-disorder mechanism has been found to 
dominate the phase transitions of those hybrid 
perovskites, such as (CH3;NH3)PbI, (42, 43) and 
[(CH3)2NH2]M(HCOO)s (M is Zn, Mn, Fe, Co, or 
Ni) (23, 24). In those organic-inorganic hybrid 
perovskites, the hydrogen-bonding interactions 
between the A-site organic ammoniums and the 
X-site anions play an important role. The order- 
disorder transitions are accompanied by the for- 
mation and breaking of hydrogen bonds. On the 
contrary, in MDABCO-NHyl,, the possible hydro- 
gen bonding between the A-site MDABCO and 
the X-site I” ion is through the Nypagco-H::1 
interactions, which are not suitable to form a 
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classic hydrogen bond because the distance of 
dy... is too large [dy...1 = 3.2098(10) A] (44). 
Nevertheless, the weak Nypapco-H:-I interac- 
tions also contribute to the spontaneous po- 
larization (fig. S27). Because other atoms are 
farther away from I ions, the A-site MDABCO 
should be stabilized mostly by the ionic inter- 
actions, as supported by the density functional 
theory (DFT) calculations (figs. S28 to $30). The 
relatively low coercive electric field (E.) may be 
related to the lack of strong hydrogen-bonding 
interactions, along with the quasispherical mo- 
lecular geometry and the relatively large cavity 
enclosed by the octahedra. 

At this stage, one can conclude that the ferro- 
electricity in MDABCO-NHglI; is mainly ionic 
and influenced by hydrogen-bonding effects and 
that the large spontaneous polarization prob- 
ably results from the parallel contributions of the 
displacement of charges, the alignment of the 
molecular dipole moments, hydrogen bond ef- 
fects, and electronic contributions. 


PFM characterization 


Finally, we used PFM to observe the polarization 
ordering and switching dynamics of MDABCO- 
NH,I, (45-48). Initial testing of single crystals 
(along the [1 1 1] direction) (figs. S31 and S32) 
indicates that the organic perovskite is domi- 
nated mainly by a large-area single-domain 
state with out-of-plane polarization (fig. $33). 
We were able to generate box-in-box bipolar do- 
main patterns after applying a tip bias of +100 V 
in a 10 um by 10 um area and then -100 Vina 
centered 5 um by 5 um square region, demon- 
strating that the polarization of the ferroelectric 
domain can be switched back and forth in the 
MDABCO-NH,I; crystal (Fig. 3A). In the micro- 
crystalline state of the material, because of the 
existence of stress imposed by the matrix, non- 
180° domains with different polarization direc- 
tions can appear. These twin domains can form 
various patterns, including stripes, which is typ- 
ical for materials such as BiFeO, (Fig. 3B and 
fig. S34) and occurs sometimes when multiple 
domains intersect one another (Fig. 3C). To 
identify different polarization directions and 
analyze the angles between them, we performed 
comprehensive PFM studies for patterns with 
multiple domain intersections (figs. S35 to S38), 
the results of which clearly indicate the exis- 
tence of eight polarization directions, corres- 
ponding well with the four polar axes deduced 
from the 432F3 transition. Most notably, we 
also successfully realized the rotation of the 
polarization direction in the MDABCO-NH,I, 
microcrystal by using an external electric field, 
as the out-of-plane amplitude intensities of the 
central zone change from weak to strong in 
Fig. 3D. The emerging domain shape tends 
toward a hexagon, in good agreement with the 
growth habit of the trigonal crystal. Furthermore, 
a local PFM study also showed the capability for 
non-180° polarization rotation (fig. S39). This 
means that a larger spontaneous polarization 
can be achieved in the corresponding polycrys- 
talline sample after electric poling, facilitating 
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Fig. 3. PFM of MDABCO-NHals. (A) Out-of-plane PFM phase image recorded after writing an area 
of 10 um by 10 um with +100 V and then the central 5 um by 5 um square with -100 V by using 

a biased conductive tip in the bulk crystal. (B and ©) PFM amplitude images in the as-grown 
microcrystals. (D) Out-of-plane PFM amplitude image after point polarization switching at the center 
point with positive bias at +40 V for 2 s in the microcrystal. 


the sample’s practical applications in the thin- 
film form. 


Other metal-free perovskite ferroelectrics 


In addition to MDABCO-NHylIs, by changing the A 
cation and X anion according to the Goldschmidt 
tolerance factor, we have developed a wide variety 
of metal-free ABX;3-type 3D perovskite ferroelec- 
trics (Fig. 4), such as ODABCO-NH,Brz (ODABCO 
is N-hydroxy-N’-diazabicyclo[2.2.2 Joctonium) with 
an Aizu notation of 432F3, ODABCO-NH,Cl; 
with 4mFmm2, rac-3AP-NH4Brs; (rac-3AP is rac- 
3-ammoniopyrrolidinium) with m3mFm, S-3AQ- 
NH,Br; (S-3AQ is S-3-ammonioquinuclidinium) 
with 432F2, and others (table S10). Chemical mod- 
ifications of the constituent ions may promote 
the performance of the resulting ferroelectrics. 
For example, the spontaneous polarization will 
be enhanced by the introduction of ionic mol- 
ecules with larger dipole moments. Moreover, 
using chiral template organic cations, we have 
successfully created optically active noncentro- 
symmetric organic perovskites. For example, mo- 
lecular cationic isomers (R-3AP)”* and (S-3AP)** 
and (R-3AQ)** and (S-3AQ)** can be easily em- 
bedded in ammonium halide cages because of 
their suitable geometric sizes and exact charge 
balancing. At room temperature, all these enan- 
tiomorphic perovskites crystallize in polar and 
chiral space group P2,. The corresponding enantio- 
morphism relationships are verified both from 
single-crystal x-ray diffraction and vibrational 


circular dichroism (VCD) spectra (Fig. 4, C and 
D, and figs. S40 to S43). As expected, the in- 
frared spectra of both enantiomorphic crystals 
are almost the same, whereas the correspond- 
ing VCD spectra are nearly mirror images. Hence, 
the introduction of organic components can lead 
to chemical diversity and chiral centers that may 
be impossible to achieve with inorganic perovskites. 
Homochiral molecules will form the enantio- 
morphic crystals of corresponding handedness 
that are easy to crystallize in the five enantio- 
morphic polar point groups (C,, C2, C3, Cy, and Cg). 
Combining the high symmetry of 3D networks, 
this system can be regarded as an ideal target for 
the development of new multiaxial ferroelectrics, 
which hold great potential for use in thin-film 
devices because of their minimum requirements 
for crystallinity and specific substrates. Mean- 
while, the optically active chiral compounds have 
a wide range of industrial and commercial appli- 
cations, from uses in pharmaceutical and biolog- 
ical industries to applications as electro-optical 
elements (49, 50), and ferroelectric optical activ- 
ity may have a variety of potential applications. 


Summary 


We successfully designed a family of metal- 
free ABX,-type 3D perovskites with high phase 
transition temperatures that are promising tar- 
gets for new ferroelectric materials. In particular, 
MDABCO-NH,I; has a large spontaneous polar- 
ization (P, = 22 wC/cm?) close to that of the 
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Fig. 4. Crystal structures and related chirality of the metal-free 3D perovskite ferroelectric 
family. Packing diagrams of (A) ODABCO-NH,Brg3 and (B) rac-3AP—NH,Brg in the ferroelectric 
phases. Crystal structures and measured VCD spectra of (C) S-3AP—NH,Br3 and R-3AP—NH,Brs 
and (D) S-3AQ—NH,Br3 and R-3AQ—NH,Brs in the ferroelectric phases. (Crystal structures of 

the analogs with Cl or | anions are shown in figs. S5 to S7.) 


inorganic perovskite ferroelectric BTO (P, = 
26 uC/ cm?) and a high phase transition temper- 
ature (Ty = 448 K) beyond that of BTO (390 K). 
The room-temperature processing not only reduces 
cost but also broadens the application potential 
by avoiding fracture during high-temperature 
annealing. Using PFM, we observed the coexistence 
of various ferroelectric domains with eight polar- 
ization directions, and we provided direct exper- 
imental proof that a flexible rotation of the 
polarization direction can occur in MDABCO- 
NH,I; through the application of an electric 
field. The family of organic perovskites resembles 
inorganic perovskite ferroelectrics PbTiO;, BaTiO, 
and BiFeOs, as they can be optimized by using 
the Curie symmetry superposition principle and 
crystal engineering strategy. We believe that this 
class of materials, combining the outstanding fer- 
roelectric properties and application advantages 
of molecular materials, holds great potential for 
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the next generation of MEMS, flexible devices, 
wearable devices, and bionics. 


REFERENCES AND NOTES 


1. G. Rose, Ann. Phys. 124, 551-573 (1839). 
2. K. Hirose, R. Sinmyo, J. Hernlund, Science 358, 734-738 (2017). 
3. H.R. Wenk, A. Bulakh, Minerals: Their Constitution and Origin 
(Cambridge Univ. Press, 2016). 
J. F. Scott, Science 315, 954-959 (2007). 
L. G. Tejuca, J. Fierro, Properties and Applications of 
Perovskite-Type Oxides (CRC Press, 2000). 
. Hu, S. Dalgleish, M. M. Matsushita, H. Yoshikawa, K. Awaga, 
Nat. Commun. 5, 3279 (2014). 
T. Akutagawa et al., Nat. Mater. 8, 342-347 (2009). 
J. F. Scott, C. A. Paz de Araujo, Science 246, 1400-1405 (1989). 
M. E. Lines, A. M. Glass, Principles and Applications of Ferroelectrics 
and Related Materials (Clarendon Press, ed. 1, 1977). 
10. F. Jona, G. Shirane, Ferroelectric Crystals (Pergamon, 1962), vol. 1. 
ll. S. Brittman, G. W. Adhyaksa, E. C. Garnett, MRS Commun. 

5, 7-26 (2015). 
12. J. P. Correa-Baena et al., Science 358, 739-744 (2017). 
13. Y. M. You et al., Science 357, 306-309 (2017). 
14. N. Arora et al., Science 358, 768-771 (2017). 
15. A. Kojima, K. Teshima, Y. Shirai, T. Miyasaka, J. Am. Chem. 

Soc. 131, 6050-6051 (2009). 


ak 


S 
_ 


16. B. Saparov, D. B. Mitzi, Chem. Rev. 116, 4558-4596 (2016). 

17. J. Berry et al., Adv. Mater. 27, 5102-5112 (2015). 

18. H. Cho et al., Science 350, 1222-1225 (2015). 

19. Y.-H. Kim, H. Cho, T.-W. Lee, Proc. Natl. Acad. Sci, U.S.A. 113, 

1694-11702 (2016). 

20. W. Li et al., Nat. Rev. Mater. 2, 16099 (2017). 

21. W. J. Xu et al., J. Am. Chem. Soc. 139, 6369-6375 (2017). 

22. Q. Pan et al., J. Am. Chem. Soc. 139, 3954-3957 (2017). 

23. P. Jain, N. S. Dalal, B. H. Toby, H. W. Kroto, A. K. Cheetham, 

J. Am. Chem. Soc. 130, 10450-10451 (2008). 

24. P. Jain et al., J. Am. Chem. Soc. 131, 13625-13627 (2009). 

25. D. W. Fu et al., Angew. Chem. Int. Ed. 50, 11947-11951 (2011). 

26. C. A. Bremner, M. Simpson, W. T. A. Harrison, J. Am. Chem. 

Soc. 124, 10960-10961 (2002). 

27. L. A. Paton, W. T. Harrison, Angew. Chem. Int. Ed. 49, 
7684-7687 (2010). 

28. Materials and methods are available as supplementary materials. 


29. V. M. Goldschmidt, Naturwissenschaften 14, 477-485 (1926). 
30. G. Kieslich, S. Sun, A. K. Cheetham, Chem. Sci. 5, 4712-4715 (2014). 
31. K. Aizu, J. Phys. Soc. Jpn. 27, 387-396 (1969). 

32. D. Kleinman, Phys. Rev. 126, 1977-1979 (1962). 

33. C. B. Sawyer, C. Tower, Phys. Rev. 35, 269-273 (1930). 

34. S. Horiuchi, Y. Tokura, Nat. Mater. 7, 357-366 (2008). 

35. D. W. Fu et al., Science 339, 425-428 (2013). 

36. H. L. Cai et al., Phys. Rev. Lett. 107, 147601 (2011). 

37. R. E. Cohen, Nature 358, 136-138 (1992). 


38. R. Comes, M. Lambert, A. Guinier, Solid State Commun. 6, 

715-719 (1968). 

39. W. Zhang, Y. Cai, R. G. Xiong, H. Yoshikawa, K. Awaga, 

Angew. Chem. Int. Ed. 49, 6608-6610 (2010). 

40. H. Y. Ye et al., J. Am. Chem. Soc. 138, 13175-13178 (2016). 

4l. P. F. Li et al., Nat. Commun. 7, 13635 (2016). 

42. J. H. Lee, N. C. Bristowe, P. D. Bristowe, A. K. Cheetham, 
Chem. Commun. 51, 6434-6437 (2015). 

43. J. H. Lee, J.H. Lee, E. H. Kong, H. M. Jang, Sci. Rep. 6, 21687 (2016). 

44. G. A. Jeffrey, W. Saenger, Hydrogen Bonding in Biological 
Structures (Springer-Verlag, 1991). 

45. D. Lee et al., Science 349, 1314-1317 (2015). 

46. S. V. Kalinin et al., Annu. Rev. Mater. Res. 37, 189-238 (2007). 

47. D. A. Bonnell, S. V. Kalinin, A. Kholkin, A. Gruverman, MRS Bull. 
34, 648-657 (2009). 

48. V. Garcia et al., Nature 460, 81-84 (2009). 

49. P. L. Polavarapu, Chirality 14, 768-781 (2002). 

50. M.-R. Goldsmith, N. Jayasuriya, D. N. Beratan, P. Wipf, 
J. Am. Chem. Soc. 125, 15696-15697 (2003). 


ACKNOWLEDGMENTS 


We thank Y. Yan (University of Toledo) for help on DFT calculations. 
Funding: This work was supported by the 973 project (2014CB932103), 
the National Natural Science Foundation of China (91422301, 
91622113, 21573041, 21574094, 21427801), and the Natural 
Science Foundation of Jiangsu Province (BK20150596 and 
BK20160029). Author contributions: H.-Y.Y. performed 
crystallographic analysis. Y.-Y.T. carried out the PFM study. 
P.-F.L., H.C., and P.-P.S. synthesized the sample. W.-Q.L. carried out 
ferroelectric characterization. J.-x.G. performed theoretical 
calculation. X.-N.H. carried out general characterizations. Y.-M.Y. 
and R.-G.X. conceived of the study and wrote the manuscript with 
input from other authors. Competing interests: All authors 
declare that they have no competing interests. Data and 
materials availability: All data are available in the main text or 
the supplementary materials. 


SUPPLEMENTARY MATERIALS 
www.sciencemag.org/content/361/6398/151/suppl/DC1 
Materials and Methods 

Supplementary Text 

Figs. Sl to S43 

Tables S1 to S12 

References (51-109) 


12 January 2018; accepted 6 June 2018 
10.1126/science.aas9330 


5 of 5 


8L0z ‘Z} Aine uo /Bio'beweoual0s‘e0ual!0s//:dijy wos} papeojumoq 


PENU3 NOAFOTOBWIA FPYNNA "What's News" VK.COM/WSNWS 


RESEARCH 


SYNTHETIC BIOLOGY 


Programming self-organizing 
multicellular structures with 
synthetic cell-cell signaling 


Satoshi Toda’, Lucas R. Blauch”, Sindy K. Y. Tang”, 


Leonardo Morsut'*+, Wendell A. Lim"t+ 


A common theme in the self-organization of multicellular tissues is the use of cell-cell 
signaling networks to induce morphological changes. We used the modular synNotch 
juxtacrine signaling platform to engineer artificial genetic programs in which specific 
cell-cell contacts induced changes in cadherin cell adhesion. Despite their simplicity, these 
minimal intercellular programs were sufficient to yield assemblies with hallmarks of 
natural developmental systems: robust self-organization into multidomain structures, 
well-choreographed sequential assembly, cell type divergence, symmetry breaking, and the 
capacity for regeneration upon injury. The ability of these networks to drive complex 
structure formation illustrates the power of interlinking cell signaling with cell sorting: 
Signal-induced spatial reorganization alters the local signals received by each cell, 
resulting in iterative cycles of cell fate branching. These results provide insights into 

the evolution of multicellularity and demonstrate the potential to engineer customized 


self-organizing tissues or materials. 


uring the development of multicellular 

organisms, tissues self-organize into the 

complex architectures essential for proper 

function. Even with minimal external in- 

structions, cells proliferate, diverge into 
distinct cell types, and spatially self-organize 
into complex structures and patterns. Such self- 
organized structures are radically different from 
most human-made structures, because they are 
not assembled from preexisting parts that are 
physically linked according to a defined Carte- 
sian blueprint. Rather, these structures emerge 
through a series of genetically programmed se- 
quential events. To test and better develop our 
understanding of the principles governing mul- 
ticellular self-organization, it would be power- 
ful to design synthetic genetic programs that 
could direct the formation of custom multicellu- 
lar structures (J-7). 

Extensive studies of natural developmental 
programs have implicated many genes that 
control cell-cell signaling and cell morphology. 
Despite their molecular diversity, a common 
theme in these developmental systems is the 
use of cell-cell signaling interactions to condi- 
tionally induce morphological responses (8, 9). 
Thus, we explored whether simple synthetic 
circuits in which morphological changes are 
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driven by cell-cell signaling interactions could 
suffice to generate self-organizing multicellular 
structures. 


A simple toolkit for engineering 
morphological programs 


As a modular platform for engineering new, or- 
thogonal cell-cell signaling networks, we focused 
on using the synthetic notch (synNotch) receptor 
system (Fig. 1A). SynNotch receptors contain the 
core regulatory domain of the juxtacrine signal- 
ing receptor Notch, linked to a chimeric extra- 
cellular recognition domain (e.g., single-chain 
antibody) and a chimeric intracellular transcrip- 
tional domain (0). When it recognizes its cog- 
nate ligand on a neighboring cell, the synNotch 
receptor undergoes cleavage of the transmem- 
brane region, releasing the intracellular tran- 
scriptional domain to enter the nucleus and drive 
the expression of user-specified target genes. 
Thus, we can design synthetic cell-cell com- 
munication programs using synNotch circuits. 
SynNotch receptor-ligand pairs do not cross-talk 
with native signaling pathways such as Notch- 
Delta, or with one another, as long as they have 
different recognition and transcriptional do- 
mains. Here, we used two synNotch receptor- 
ligand pairs—an anti-CD19 single-chain antibody 
(scFv) receptor paired with CD19 ligand, and an 
anti-green fluorescent protein (GFP) nanobody 
receptor paired with surface GFP ligand—as or- 
thogonal cell-cell communication channels. 

We created potential developmental programs 
by linking synNotch signaling to two possible 
transcriptional outputs: (i) expression of specific 
cadherin molecules (E-, N-, and P-cadherins), which 
lead to homotypic cell-cell adhesion and differ- 
ential sorting of cells expressing different classes 
of adhesion molecules (77-13); and (ii) expression 


of new synNotch ligands (Fig. 1A). Morphological 
sorting driven by cadherin expression can change 
what cells are next to each other, thus altering 
what synNotch signals will or will not be trans- 
mitted. Similarly, expression of new synNotch 
ligands can also create a subsequent stage of new 
cell-cell signals. Consequently, both of these out- 
puts can propagate regulatory cascades by gen- 
erating new signaling interactions between cells 
in the collective assembly. 

We also constructed the synNotch circuits so 
that they drive expression of different fluorescent 
proteins, allowing color to indicate “differentia- 
tion” into new cell types (Fig. 1B). We expressed 
these synNotch circuits in mouse L929 fibroblasts, 
placed the cells in a low-adhesion U-bottom well 
(14), and followed their organization over time by 
fluorescence microscopy. L929 cells do not self- 
organize; normally, they only form a loose and 
randomly organized multicellular aggregate. We 
then tested whether any of the synthetic circuits 
we constructed from this small set of components 
could drive higher-order self-organization. 


Engineering interacting cells that 
self-organize into a two-layer structure 


We first focused on engineering two cell types 
that, when mixed, might communicate with and 
activate one another to induce the formation of a 
self-organized structure. We engineered a sender 
cell that expresses the synNotch ligand CD19 and 
blue fluorescent protein (BFP) (cell A) and a re- 
ceiver cell that expresses the cognate anti-CD19 
synNotch receptor and its response element 
(cell B). To induce cell sorting as an output of 
synNotch signaling, we placed the E-cadherin 
(Ecad) and GFP genes under the control of the 
synNotch-responsive promoter in the receiver 
cells (cell B in Fig. 2A). The circuit is represented 
by the following scheme: 


[cell A: CD19] — [cell B: aCD19 synNotch 
— Ecad™ + GFP] 


As predicted, when cocultured with A-type 
sender cells, B-type receiver cells were acti- 
vated to express Ecad and GFP (C-type cell 
phenotype). Subsequently, the green (GFP) 
C-type cells self-sorted to form a tight inner 
core, resulting in a well-defined two-layer struc- 
ture (Fig. 2, B and C). Without induction of 
Ecad, the A- and B-type cells remained well- 
mixed (fig. SIA). When the synNotch signaling 
was inhibited by the y-secretase inhibitor (2S)- 
N-[(3,5-difluorophenyl)acetyl]-.-alanyl-2- 
phenylglycine 1,1-dimethylethyl ester (DAPT), 
which blocks synNotch cleavage and signaling, 
sorting into two layers did not occur, as the in- 
hibitor blocked the Ecad induction response in 
B-type cells (fig. SIB). 


Using a bidirectional signaling cascade 
to engineer a self-organizing 
three-layer structure 


To create more a complex structure, we added an 
additional layer of reciprocal cell-cell signaling to 
the above two-layer circuit (Fig. 2D). We engineered 
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the receiver (B-type) cell so that in addition to 
inducibly expressing Ecad, it also inducibly 
produced surface-tethered GFP as a synNotch 
ligand (GFP);,). This modified form of GFP is 
constructed by fusion with the platelet-derived 
growth factor receptor transmembrane (PDGFR 
TM) domain (0). Surface-tethered GFP served as 
both a fluorescent reporter of the new cell type 
and a ligand for a secondary synNotch receptor 
with the cognate anti-GFP nanobody binding 
domain. In the sender cells, which constitutive- 
ly express BFP and CD19 ligand, we also ex- 
pressed the anti-GFP synNotch receptor, which 
when induced would drive expression of a low 
amount of E-cadherin (Ecad'°) fused with an 
mCherry reporter for visualization. Thus, the 
interaction between this pair of cell types can 
in principle yield a two-step cascade of re- 
ciprocal signaling: In the first step, CD19 on 
cell A activates anti-CD19 synNotch in cell B to 
induce expression of a high level of E-cadherin 
(Ecad") and the GFP);,. In the second step, the 
GFP); on cell B can reciprocally activate the 
anti-GFP synNotch receptors in neighboring A- 
type cells to induce a low level of E-cadherin 
alongside the mCherry reporter. In this case, 
the A-type cell starts out as a sender cell but 
later becomes a receiver cell. The circuit is 
represented by the following scheme: 


[cell A: CD19; wGFP synNotch] 
— [cell B: aCD19 synNotch — Ecad” + GFPiig] 
— [cell A: aGFP synNotch —> Ecad”® + mcCherry] 


This circuit was predicted to form a three-layer 
structure: a green internal core (Ecad" + GFP) 
with the highest homotypic adhesion, an outer 
layer of blue cells (no Ecad), and a new popu- 
lation of red (Ecad!° + mCherry) cells at a mid- 
dle interface layer (Fig. 2E). We first engineered 
and established cell A and cell B lines from 
single-cell clones, and then confirmed that they 
showed synNotch-driven expression of high or 
low amounts of Ecad along with the appropriate 
marker fluorescent proteins (fig. S2A). 

When we cocultured 200 A-type cells and 
40 B-type cells, a three-layer structure was ro- 
bustly formed, with a development process that 
required ~20 hours to fully unfold (Fig. 2F, fig. 
S2B, and movie S1). The structure emerged in a 
stereotypical stepwise fashion: induction of the 
green cells, sorting to form an inner core, and 
then the formation of a red middle layer. Here, 
the cascade of cell sorting and reciprocal signal- 
ing from the green core cells drives fate branch- 
ing of the original A-type sender cells into two 
distinct fates (red and blue). Thus, this program 
has substantial ordering power: (i) The program 
generates an increased number of cell types (two 
cell genotypes become three phenotypic cell 
types), and (ii) the program leads to spatial 
sorting into three distinct compartments. This 
change represents a decrease in entropy relative 
to the starting point of a random mixture of two 
cell types, as shown in the cell lineage map (Fig. 
2E). Many of these features of increased self- 
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ordering observed in this engineered assembly 
mimic the behavior of natural developmental 
systems, such as the simple formation of dis- 
tinct progenitor cell types in early embryogen- 
esis (15, 16). 

The observed self-organization could be blocked 
by disrupting either synNotch signaling or cadherin 
expression. When we blocked cell-cell signaling 
with an inhibitor of synNotch signaling (DAPT), 
we observed no increase in cell types and no cell 
sorting into distinct layers (Fig. 2G and fig. S3B). 
When we removed E-cadherin expression from 
the system (fig. S3A), the assembled cells induced 
expression of the GFP and mCherry markers, but 
the different cell types remained randomly mixed 


(Fig. 2G). Thus, the interlinking of signaling and 
cell sorting is required for cell fate divergence 
and spatial ordering. 


Synthetic assembly is robust, reversible, 
and self-repairing 


To see how reproducibly the synthetic cell-cell 
signaling program could drive three-layer for- 
mation, we followed 28 independent replicate 
cocultures starting with 200 A-type cells and 
40 B-type cells (Fig. 3A). In most wells (57%), 
cells formed a single three-layer spheroid. In 
other wells, we observed “twin” multicore three- 
layer spheroids (21%) or multiple (separate) 
three-layer spheroids in the same well (11%). 


A MORPHOLOGICAL PROGRAMMING LOGIC 


Spatial reorganization 
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Fig. 1. Engineering cell-cell communication networks to program synthetic morphogenesis. 


spatial self-organization. 


A) Design logic underlying our synthetic morphogenesis circuits. Engineered cell-cell signaling is 
used to drive changes in cell adhesion, differentiation, and production of new cell-cell signals. 
These outputs can subsequently be propagated to generate new cell-cell signaling relationships. 
B) Molecular components used for assembly of simple morphological circuits. We used two 
synNotch ligand-receptor pairs (surface ligands CD19 and GFP) for cell signaling, three fluorescent 
proteins as markers of “differentiation,” and several cadherin molecules (expressed at different 
evels) as morphological outputs. Engineered circuits are transduced into L929 fibroblast cells, 
placed in defined numbers in low-adhesion U-bottom wells, and screened by microscopy for 
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Thus, the overall three-layer architecture of 
green, red, and blue cells was robustly gener- 
ated in ~90% of the cultures. A 3D reconstruc- 
tion image of three-layer structure is shown in 
Fig. 3B and movie S1. Three-layer formation was 
robust to variation in the initial number or ratio 
of starting cells (fig. S2C). Only when we used a 
low number of starting A-type cells did we begin 
to see formation of two-layer structures (green 
and red only), because all the A-type cells were 
converted to Ecad” cells (i.e., the number of A 


cells was limiting). 


Fig. 2. Engineering self-organizing multilayered 


spheroids. (A to C) Two-layer circuit. (A) An 


A-type sender cell expressing CD19 ligand induces 


a B-type receiver cell to express E-cadherin and 
GFP. (B) SynNotch cell-cell signals drive receiver 
cells to express E-cadherin (Ecad), which leads to 
their segregation into a central layer. The system 
starts with two disordered cell genotypes but 
organizes to form a structure with two distinct 


spatial compartments. (C) Images of the spheroid 


at 1 and 24 hours. See fig. S1 for other data. 


(D to G) Three-layer circuit. (D) An A-type cell can 


send signals to a B-type cell using CD19 ligand, 
which induces expression of E-cadherin (high 
expression) and GFPiig (surface-expressed GFP). 
The induced B-type cell can then send reciprocal 


signals to the A-type cell; GFPiig serves as ligand to 


stimulate anti-GFP synNotch receptor expressed 
in the A-type cell. This reciprocal interaction is 
programmed to drive a 
mCherry. (E) Cell fate diagram showing how this 
program drives a two-step differentiation process 
in which the A+B synNotch signal first drives 


ow level of E-cadherin and 


conversion of B-type ce 


Is to C-type cells that 


self-adhere and sort to the center of the structure. 
The sorted C-type cells then present the C>A 


synNotch signal (driven 


by GFPiig) to convert 


spatially adjacent A-type cells into the middle-layer 
D-type cell (mCherry and low-level E-cadherin 
expression). A-type cells bifurcate into two 
phenotypes, depending on their spatial proximity 
to the C-type cells in the core of the structure. 
Here, the system starts with two disordered cell 
genotypes but self-organizes into three distinct cell 


phenotypes organized 


into three spatially distinct 


compartments. (F) Images from the development of 
the three-layer system from O to 20 hours. See 
fig. S2 and movie S1 for other data and time-lapse 


videos. (G) Formation of 


the three-layer structure is 


disrupted if synNotch signaling is inhibited (using 
DAPT, a y-secretase inhibitor) or if cadherins are not 
driven as outputs. See fig. S3 for more information. 
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In many cases, natural self-organized tissues 
have an ability to regenerate after injury (17). To 
test how this three-layer structure would respond 
to injury, we cut the structure into two fragments 
with a microfluidic guillotine system (78) (movie 
82). Immediately after cleavage, the GFP-positive 
core cells were exposed to the surface, but within 
24: hours, the green core cells were re-enveloped 
by the red layer, regenerating the spherical three- 
layer structure (Fig. 3C). To further test the revers- 
ibility of the self-assembled three-layer structure, 
we added the synNotch inhibitor DAPT to pre- 


formed structures. The layered structure and dis- 
tinct cell types were totally disrupted within 3 days 
of treatment; hence, this dynamically maintained 
structure can be disassembled by turning off cell- 
cell signaling (Fig. 3D). 


A single-genotype circuit that induces 
cell fate bifurcation and spatial 
ordering into a two-layer structure 


We also wanted to explore whether we could 
program self-organizing structures that could 
start from a single cell type. Alternative bistable 
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A THREE LAYER CIRCUIT 
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Fig. 3. Three-layer self-organized structure is robust, reversible, and 
self-repairing. (A) Distribution of structures generated in 28 independent 
wells (starting with 200 A-type cells and 40 B-type cells). About 90% of 
the wells showed formation of three-layer structures; the majority of these 
showed one spheroid per well, with the remainder showing either twinned 
spheroids or multiple independent three-layer spheroids. Example images of 
these structural subtypes are shown at the right. (B) Three-dimensional 
confocal reconstruction of a three-layer structure cross section, shown 
from two views. See movie S1 for full rotational view of the 3D structure. 


Fig. 4. Single-genotype A 
circuit that induces fate 
bifurcation and spatial 
ordering into a two-layer 
structure. (A) Design of 
single-genotype circuit 
with lateral inhibition 
between sender (CD19*) 
and receiver (antiCD19- 
synNotch—activated) 
states. The cell encodes 
both CD19 and anti-CD19 
synNotch, but activated 
synNotch receptor drives B 
expression of tet repressor 

(tTS), which inhibits 

CD19 expression. Thus, 

neighboring cells will drive 

each other into opposite 

states indicated by red and 

green fluorescent markers 

(fate RED and GREEN). 

(B) E-cadherin expression 

driven from the synNotch- 
activated promoter. An 

initially homogeneous 
population of red cells 
undergoes bifurcation into 

RED fate and Ecad-positive GREEN fate by lateral inhibition, and GREEN-fate 
cells are finally sorted inside to form an inner core. The system starts with 
a single-genotype population but is expected to organize into a two-layer 
structure. (C) Purification of a homogeneous population by sorting for 
mCherry2"/GFP'™ cells. When allowed to communicate through lateral 
inhibition, the cells rebifurcate into two distinct fluorescently labeled 
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(C) Self-repair of a cleaved three-layer structure. The preformed spheroid 
was cleaved using a microfluidic guillotine, and the two resulting fragments 
were observed for 25 hours. The frames at O hours show the two 
fragments, with a dotted line indicating the cleavage plane that exposes 
the internal core of the spheroid. Images at 25 hours show self-repair 
of the spherical three-layer structure. (D) The structure is reversible 

if treated with the synNotch inhibitor DAPT. Within 3 days, the differentiation 
and spatial organization of cells disappeared. Original A- and B-type cells 
became randomly organized. 
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populations (bottom). See fig. S4 and supplementary materials for more 
information on how the lateral inhibition circuit was constructed and 
executed. (D) Development of the single-genotype two-layer structure. 
Time frames are shown at 1, 25, and 50 hours, showing initial cell 

fate bifurcation followed by formation of a stable two-layer structure. 
See fig. S5 for more information and movie S3 for time-lapse video. 


4 of 7 


8102 ‘S| Aine uo /Bi0'beweousalos‘eoual!0s//:dyjy wos} pepeojumoq 


PEJIU3 NOATOTOBUJIA TPYNNA "What's News" VK.COM/WSNWS 


RESEARCH | RESEARCH ARTICLE 


Fig. 5. Programming spherically asymmetric 
structures by inducing differentially sorting 
adhesion molecules. (A) Logic of deploying 
alternative adhesion outputs to generate different 
spatial structures. In the spherically symmetric 
structures of Figs. 2 to 4, we used high and low 
evels of Ecad expression to define different 
populations of cells. High- and low-Ecad 
populations lead to sorting into concentric shells, 
because Ecad'° cells still prefer to bind Ecad" cells. 
n contrast, two cell populations that express 
either Ncad or Pcad will sort into distinct com- 
partments (nonconcentric) because each of these 
cadherins prefers homotypic self-association to 
heterotypic cross-association. (B) Three-layer 
asymmetric circuit |, with the same architecture as 
that shown in Fig. 2, except that B-type cells are 
induced to express Ncad and A-type cells are 
induced to express Pcad. In phase II of the 
development (reciprocal B>A signaling), the 
A-type cells become red and self-sort to form one 
to three external poles (with unactivated A-type 
cells associated at their periphery). The starting 
population included 100 cells of each type. 
When we started with only 30 cells of each type 
(right image), we reproducibly generated single- 
pole structures. See fig. S7 and movie S4 for 
more information, time-lapse videos, and 3D 
structure. (C) Three-layer asymmetric circuit II. 
An A-type cell constitutively expresses Pcad and 
mCherry as well as CD19 ligand. B-type cells 
recognize CD19 with anti-CD19 synNotch recep- 
tor, which drives expression of Ncad and GFPiig. In 
reciprocal signaling, GFPiig drives induction of a 
BFP marker in A-type cells. Here, the red A-type 
cells first form a central core and the induced 
green B-type cells form polar protrusions. A third 
cell type (blue) forms at the boundary between 
the red core and the green protrusions. See fig. 
$8 and movie S5 for more information, time- 
lapse video, and 3D structure. Information on 
other structures using different cadherin pairs is 
shown in figs. S9 and S10 and movies S6 and S7. 


cell fates can be generated from a single start- 
ing cell genotype through a mechanism known 
as lateral inhibition (19). For example, cross- 
repression between Notch receptor and its lig- 
and in neighboring cells can result in a bistable, 
checkerboard fate pattern, where individual 
cells bifurcate into either Notch*“”*-ligand’™ or 
Notch?’ Jigand@" states (20, 21). We built 
an analogous lateral inhibition circuit using 
synNotch cross-repression in L929 cells (fig. 
S4B). Each cell encoded both CD19 (ligand) and 
the anti-CD19 synNotch receptor, but these are 
antagonistic to each other because the synNotch 
receptor induces expression of the Tet repressor 
(tTS), which can repress CD19 expression (con- 
trolled by a TetO promoter). Thus, if synNotch 
is stimulated by a neighboring cell with high 
CD19 expression, it will repress CD19 ligand 
expression, thereby forcing cells to choose be- 
tween either a sender or receiver fate. CD19 and 
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tTS expression were monitored by mCherry and 
GFP, respectively (expressed in linked transcrip- 
tional cassettes through a ribosomal skipping 
porcine teschovirus-1 2A sequence). We es- 
tablished multiple clones that bifurcated spon- 
taneously into two populations of mCherry or 
GFP-positive cells (fig. S4B; see supplementary 
materials for details of how we established lat- 
eral inhibition lines). These cell lines consist- 
ently reestablished the two phenotypic states, 
even when starting with a pure sorted popula- 
tion of either the red or green state (Fig. 4C and 
fig. S4B). 

To produce a spatially ordered structure from 
a single cell type, we then functionally combined 
two different organizational circuit modules: this 
bifurcating cell fate circuit and the self-organized 
E-cadherin-driven two-layer circuit (Fig. 2A). To 
construct such a composite circuit, we expressed 
E-cadherin from the synNotch-driven promoter 


(in addition to inducing expression of the tet 
repressor) (Fig. 4A and fig. S5A). The objective 
was to start with a single cell type and observe 
self-driven fate bifurcation followed by self- 
driven sorting into two layers. 

To track how the system developed from a 
single cellular phenotype, we sorted red-fate 
cells (CD19 yigh), placed 100 cells in each well, 
and followed the development of the spheroid 
by time-lapse microscopy. These cells developed 
into a spheroid in which the cells first underwent 
bifurcation into a red-green checkerboard pat- 
tern and then, over the course of hours, formed 
a two-layer structure with green cells inside and 
red cells outside (fig. S5, B and C). These two- 
layer structures were stable for 100 hours. 
Addition of the Notch signaling inhibitor DAPT 
prevented fate bifurcation (fig. S5C). But after 
removal of the drug and re-sorting, the cells re- 
mained bipotent; they could still bifurcate and 
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reform the two-layer structure (Fig. 4D, fig. S5C, 
and movie S3). Thus, we can engineer synthetic 
programs in which a single cell genotype bifurcates 
and spatially self-organizes into multiple layers. 


Programming spherically asymmetric 
structures by inducing differentially 
sorting cell adhesion molecules 


Another key feature of natural morphogenesis is 
symmetry breaking, used repeatedly during de- 
velopment to generate body axes and elaborate 
an initially uniform ball of cells (22, 23). The struc- 
tures described above are all spherically symmet- 
ric, but we could program asymmetric structure 
formation with the same signaling cascade cir- 
cuit by simply changing the adhesion molecules 
that were expressed. 

To build the spherically symmetric three-layer 
structure described above (Fig. 2D), we program- 
med different subsets of cells to express dif- 
ferent amounts of the same adhesion molecule 
(E-cadherin), which generates spherically sym- 
metric concentric layers (because Ecad” cells still 
prefer to interact with Ecad™ cells; see relative 
interaction energies in Fig. 5A). However, if cells 
express different cadherins that have high homo- 
typic affinity but low heterotypic affinity, they 
phase-separate into two spatially distinct popu- 
lations (Fig. 5A). N-cadherin (Ncead) and P-cadherin 
(Pcad) have high homotypic affinity (Ncad-Ncad 
and Pcad-Pcad) but low heterotypic affinity (Ncad- 
Pcad) (24), so we used the combination of Ncad 
and Pcad expression to try to drive asymmetric 
sorting and structure formation (fig. S6). 

We introduced Nead and Pcad as morpholog- 
ical outputs in the basic three-layer circuit. First, 
CD19 synNotch signaling from cell A induced 
expression of Ncad and GFP}, in cell B; second, 
the induced GFP), on cell B reciprocally activated 
anti-GFP synNotch in the adjacent subpopulation 
of A cells, driving Pcad expression (Fig. 5B, fig. S7, 
and movie S4). When we cultured 100 cells each 
of type A and B together, we observed a stereo- 
typical developmental sequence: By 13 hours, 
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B-type cells expressed both Ncead and GFPij., 
and by 21 hours, A-type cells adjacent to B-type 
cells began to express Pcad and mCherry. Because 
of the resulting self-segregation of the Ncad- and 
Pcad-expressing cells, the ensemble self-organized 
into a nonspherically symmetric three-layer struc- 
ture (green, red, blue) with one to three distinct 
poles of mCherry (Pcad) cells. A-type cells (blue) 
not activated through their anti-GFP synNotch 
receptors were associated with the outer surface 
of these poles. 

When we initiated cultures with a smaller 
number of starting cells (30 cells each of type A 
and B), the ensemble reproducibly formed a 
single-pole asymmetric structure (a single cluster 
of red cells instead of multiple clusters), consist- 
ent with many examples of polarized organization 
in which a smaller starting size minimizes the 
chance of initiation of multiple independent poles 
(Fig. 5B, fig. S7, and movie S4) (25). Thus, we 
could reliably program systems that would form 
three-layer asymmetric or polarized structures. 

We designed other circuits that induced alter- 
native types of asymmetric structures with the 
same Nead-Pcad output combination but were 
regulated in different sequential programs. In 
the circuit shown in Fig. 5C, cell A was similar 
to the above example (it expressed CD19 ligand 
and anti-GFP synNotch receptor driving expres- 
sion of BFP), except that it also constitutively 
expressed Pcad [connected with mCherry via an 
internal ribosome entry site (IRES) sequence]. 
Cell B was the same as in Fig. 5B (it expressed 
anti-CD19 synNotch receptor that induced Nead 
and GFP). expression). When cultured together, 
the Pcad-expressing A cells (red) immediately 
formed an adherent aggregate (4 hours); then, 
after 14 hours, Ncad and GFP). were expressed 
in B-type cells, leading to the formation of polar- 
ized B-type protrusions (green) segregated from 
the A-type cells (red). Finally, at 34 hours, A-type 
cells at the interface with B-type cells were acti- 
vated by GFP-synNotch signaling to turn on BFP, 


resulting in a thin boundary layer of blue cells be- 


Fig. 6. Gallery of different self-organizing 
multicellular structures that can be programmed 
using the simple synNotch-adhesion toolkit. 
(A) Gallery of spatially organized behaviors 
generated in this work, organized by resulting 
number of cell types and spatially distinct 
compartments as well as by increasing 
asymmetry. See table S1 for details of the 
construction of these 12 structures. Diagrams 
of several of the different three-layer structures 
are shown schematically below. (B) These 
synthetic developmental systems share the 
common principles in which cascades of cell-cell 
signaling, linked by morphological responses, 
lead to increasing diversification of cell types. 
As signaling drives morphological changes 

and reorganization, new cell-cell interactions 
arise, resulting in increasingly distinct positional 
information encountered by each cell in 

the structure. 


tween the polarized red and green regions (time- 
lapse and 3D reconstruction image shown in fig. 
S8 and movie S5). Additional types of combina- 
torial circuits using different cadherin pairs are 
shown in figs. S9 and S10 and in movies S6 and $7. 
These results confirmed that we can build var- 
ious self-organizing structures that break spheri- 
cal symmetry by inducing distinct self-segregating 
adhesion molecules in different subpopulations 
of cells. Initial conditions with small cell num- 
bers can reproducibly yield structures with a 
single polar axis. Moreover, we can generate many 
different three-layer morphological structures 
by altering the combinations of adhesion mole- 
cules used and by altering at what stage in the 
circuit they are expressed (Figs. 5 and 6). 


Minimal intercellular communication 
programs can drive synthetic 
self-organizing cellular structures 


Figure 6A and table S1 summarize the various 
self-organizing synthetic structures we programed 
with our minimal logic of controlling cell adhe- 
sion (cadherin expression) through cell-cell com- 
munication (synNotch signaling). The diversity 
and complexity of these structures, and the ro- 
bustness with which they are formed, illustrate 
the ordering power of even these highly sim- 
plified cell-cell signaling programs. In all of these 
systems, we observed a cyclic sequence of events 
in which initial cell signaling interactions in- 
duced morphological rearrangements, which in 
turn generated new cell-cell interactions and new 
morphological refinements (Fig. 6B). Complex 
structures emerge because these cell-cell signaling 
cascades drive increasing cell type diversification. 

These diverse emergent structures can form 
even in the absence of many of the molecular 
components normally used in natural develop- 
mental systems. For example, these circuits do 
not incorporate diffusible morphogens for cell- 
cell communication, irreversible cell fate com- 
mitment, or direct regulation of cell proliferation, 
death, or motility (8, 26-29). It is likely that the 
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synthetic platforms used here could be extended 
to include many of these additional elements to 
generate even more sophisticated engineered self- 
organizing multicellular structures (30-35). 

The observation that even minimal circuits 
that link cell-cell signaling to adhesion can lead 
to the formation of defined self-organizing struc- 
tures may help to explain the general principles 
by which multicellular organisms could have 
evolved. Choanoflagellates, the closest single- 
cell relatives of metazoans, have both primitive 
cadherin and notch genes (36). The cadherin 
genes are thought to have originally functioned 
to trap prey bacteria in the environment and 
may have later been co-opted for cell-cell adhe- 
sion (37, 38). In some choanoflagellate species, 
environmental signals from prey bacteria can 
induce the formation of multicellular assemblies 
(39, 40). It seems plausible that cell-to-bacteria 
adhesion transitioned to cell-cell adhesion, and 
that bacteria-to-cell signaling transitioned to 
cell-cell signaling. During the course of evolu- 
tion, these systems may have begun to regulate 
one another, providing a starting point for cir- 
cuits capable of driving formation of complex 
multicellular structures. 

More generally, these findings suggest that 
it may be possible to program the formation of 
synthetic tissues, organs, and other non-native 
types of dynamic, multicellular materials. We may 
be able to apply tools like synNotch, perhaps 
enhanced by an even larger toolkit of modular 
developmental signals, to construct customized 
self-assembling tissue-like biomaterials of di- 
verse types. These tools and approaches also pro- 
vide powerful tools to systematically probe and 
better understand the principles governing self- 
organization and development. 
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Phase transitions in a programmable 
quantum spin glass simulator 


R. Harris™, Y. Sato’, A. J. Berkley’, M. Reis’, 
P. Bunyk’, C. Deng’, C. Enderud’, S. Huang’, 


F. Altomare’, M. H. Amin””, K. Boothby’, 
E. Hoskinson’, M. W. Johnson’, 


E. Ladizinsky’, N. Ladizinsky', T. Lanting’, R. Li’, T. Medina’, R. Molavi™*, R. Neufeld’, 
T. Oh’, I. Pavlov’, I. Perminov', G. Poulin-Lamarre’, C. Rich’, A. Smirnov’, L. Swenson’, 
N. Tsai’, M. Volkmann’, J. Whittaker’, J. Yao" 


Understanding magnetic phases in quantum mechanical systems is one of the essential goals 
in condensed matter physics, and the advent of prototype quantum simulation hardware 
has provided new tools for experimentally probing such systems. We report on the 
experimental realization of a quantum simulation of interacting Ising spins on three- 
dimensional cubic lattices up to dimensions 8 x 8 x 8 on a D-Wave processor (D-Wave 
Systems, Burnaby, Canada). The ability to control and read out the state of individual 
spins provides direct access to several order parameters, which we used to determine the 
lattice’s magnetic phases as well as critical disorder and one of its universal exponents. By 
tuning the degree of disorder and effective transverse magnetic field, we observed phase 
transitions between a paramagnetic, an antiferromagnetic, and a spin-glass phase. 


an quantum physics be efficiently simu- 

lated on a computer? What kind of com- 

puter would one need to do so? Feynman 

posed these questions to motivate the con- 

cept of a probabilistic quantum mechanical 
computer that could be used to simulate nature 
(1). Whereas the implementation of such com- 
puters seemed daunting in Feynman’s time, sub- 
sequent progress in the laboratory has made 
small-scale quantum simulation possible today 
(2-9). Although the field of quantum simulation 
is still in its infancy, the scale and the control of 
engineered quantum systems has reached the 
point where they can begin to address funda- 
mental problems in physics. 

In this Report, we describe the use of an 
engineered quantum system to investigate the 
properties of a particular quantum magnetism 
problem known as the transverse field Ising 
model (TFIM). The TFIM is often viewed as the 
archetypal model for order-disorder phase tran- 
sitions in quantum systems (JO). A diverse col- 
lection of condensed matter systems, such as 
ferroelectrics (17) and magnetic spin glasses 
(12), can be described by the TFIM. Although 
this model can be analytically solved in one di- 
mension, it is generally only amenable to numer- 
ical study via computationally intensive means, 
such as quantum Monte Carlo (13), in higher 
dimensions. We have simulated the TFIM on 
three-dimensional (3D) simple cubic (SC) lattices 
up to dimensions 8 x 8 x 8 by embedding the 


1D-Wave Systems, 3033 Beta Avenue, Burnaby, BC V5G 
4M9, Canada. “Department of Physics, Simon Fraser 
University, Burnaby, BC V5A 186, Canada. “Department of 
Electrical and Computer Engineering, University of British 
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model on a D-Wave quantum processing unit 
(QPU) (D-Wave Systems, Burnaby, Canada) (74). 
By tuning the amount of disorder within the lat- 
tice and varying the effective transverse mag- 
netic field, we demonstrate phase transitions 
between a paramagnetic (PM), an ordered anti- 
ferromagnetic (AFM), and a spin-glass (SG) phase. 
The experimental results compare well with the- 
ory for this particular SG problem, thus validat- 
ing the use of a probabilistic quantum computer 
to simulate materials physics. This represents an 
important step forward in the realization of in- 


critical 
point 


tegrated quantum circuits at a scale that is rele- 
vant for condensed matter research. 

The system studied here is a SC lattice of 
Ising spin-1/2 particles subject to a global trans- 
verse field splitting (10). The single-spin energy 
splitting induced by the transverse field is de- 
noted by I, and the magnitude of the nearest- 
neighbor interaction energy is denoted by .7. 
To induce glassy behavior, let the sign of the 
interaction vary randomly, with p and 1 - p de- 
noting the probability that a particular interac- 
tion J;; be -1 (ferromagnetic) and +1 (AFM), 
respectively. The Hamiltonian for this system 
can be written as 


Hsa = J) Jy6i6; —T) 6? (1) 
@ i 


where G; and 6; are the Pauli z and a op- 
erators, respectively, acting on spin 7, and the 
notation (2) indicates summing over all near- 
est neighbor pairs. A schematic phase diagram 
for this system is depicted in Fig. 1A. For I = 0, 
this is a well-studied classical system, with phase 
transitions induced by thermal fluctuations. The 
critical disorder p, = 0.222 (15) marks the value 
of p below which one observes an ordered AFM 
phase, whereas above this value one observes a 
SG phase. Critical temperatures ky TGy./J = 4.5 
at p = 0 (16) and kgTg,/7 = 1.1 at p = 0.5 (17), 
where kz is the Boltzmann constant, indicate 
the locations of PM-AFM and PM-SG thermal 
phase transitions, respectively, at [ = 0. As T is 
increased, phase boundaries evolve because of 
the increased role that quantum fluctuations 
play over thermal fluctuations. The critical trans- 
verse field energies [xpy,/J * 5.2 at p = 0 (18) 
and T'§,/J * 3.1 at p = 0.5 (19) indicate loca- 
tions of quantum phase transitions between 
the PM-AFM and PM-SG phases, respectively, 
at T= 0. 


Fig. 1. The bond-disordered TFIM on a simple cubic lattice. (A) Schematic phase diagram. Axes 
correspond to scaled transverse magnetic field splitting ['/.7, disorder p, and scaled temperature 
kgT/J, where 7 is the Ising interaction strength. The experiments reported on here probed this space 
on a curved manifold (translucent surface). Any single experiment for fixed p involves annealing 
along a trajectory on this manifold (black dotted curve) that starts in the PM phase, and a critical 
point is encountered where the trajectory penetrates a phase boundary (silver point). (B) Illustration 
of a ground state at T = O for an 8 x 8 x 8 lattice instance for p = 0.1 < p,. Ferromagnetic (J = —1) 
and AFM (Jj = +1) bonds are indicated by gold and silver cylinders, respectively. Ising spin 
orientations t and 4 are represented by red and blue spheres, respectively. 
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The SC lattice described above was embedded 
into the native graph of a prototype QPU (J4). 
The QPU and the embedding are described in 
the supplementary materials. Lattices up to size 
8 x 8 x 8 fit into the QPU, an example of which 
is shown in Fig. 1B. Within this experimental 
platform, the coupling energy 7 and transverse 
field splitting [ are monotonic functions of a di- 
mensionless control parameter 0 < s < 1 so that 
I'(s = 0) > J(s = 0)andI(s = 1) « J(s = 1). For 
fixed temperature T ~ 12 mK (kg7/h = 250 MHz), 
the QPU can be used to explore the phase space 
on a curved manifold in (p, 0/7, kgT /7)-space, 
as indicated in Fig. 1A. Details concerning the 
functional form of this manifold can be found 
in the supplementary materials. 

To experimentally determine the critical dis- 
order p,, we first worked in a classical regime by 
noting that the final Hamiltonian at s = 1 is 
given by the classical form of Hamiltonian (Eq. 1) 
with Is = 1) — 0. Ensembles of up to 1000 ran- 
dom instances, with each instance corresponding 
to a specific set of J;, were generated for a series 
of values of p and lattice sizes L x L x L. Each 
instance was programmed into the QPU, and low- 
energy states of the final classical Hamiltonian 
were obtained by slowly evolving from s = 0 to 
S$ = 1in a process known as quantum annealing 
(QA) (20-22). Experimental details are available 
in the supplementary materials. QA was per- 
formed 1000 times per instance, and the final 
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states of every spin were recorded. In order to 
approximately estimate ground-state properties, 
the results were then filtered in order to iden- 
tify the lowest-energy state returned by the QPU 
for each instance. In cases in which the lowest 
observed energy was degenerate, all such states 
were kept. The result was a set of low-energy 
states |s) = |s59...8y), where N = L? is the num- 
ber of lattice spins, for each value of p that could 
then be used to determine disorder-averaged 
quantities, as described below. 

To detect the AFM-SG phase transition as a 
function of p, we calculated the absolute value 
of the AFM magnetization (also known as stag- 
gered magnetization) 


N 


1 heap: 
Oe (—1)% tht g, 


i=1 


(2) 


|7arm| = 


for each state |s), where the vector of integers 
(a;, b,, ¢;) corresponds to the position of site 7 in 
the SC lattice (relative to an arbitrary origin), 
and s; is the state of the spin at site 7. A sum- 
mary plot of (|77rm|) versus p, where (...) denotes 
averaging over disorder, is shown in Fig. 2A. 
Data are shown for three system sizes L x L x L, 
for L € [4, 6, 8]. For all system sizes, (|77,ru1) 
monotonically decreases with increasing p, and 
the slope near p = p,. becomes increasingly steep 
with larger L. However, this quantity does not 
strictly go to zero for p > p, because of finite 
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Fig. 2. Determination of the critical disorder p, and AFM susceptibility yarm versus quantum 
annealing parameter s. (A) Disorder-averaged AFM magnetization (|Marm|) versus p for three 
different lattice sizes L x L x L. (B) Binder cumulant g versus p. For each L, the results near p, have 
been independently fit to second-order polynomials (solid curves). The critical disorder is given 

by the mutual crossing point, which is estimated to be pe“? = 0.22 + 0.01 (dashed vertical line). 
(C) Finite-size scaling collapse of the experimental results. Assuming that g is a function of 

scaled disorder L!/’e(p — pe*P), the best collapse was obtained for 1/Vexp = 0.85 + 0.05. (D) xarm 
versus S for p < Pc. (E) yarm versus S for p > Pc. (F) Normalized yarm versus S for p > Pe. 
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size effects (23). To determine the experimen- 
tally observed critical point pt*®, we calculated 
the Binder cumulant (24) 


4 

g=1= _{Mxrm) — (3) 
3( (mpm) 

versus =p for L € [4, 6, 8], because this quantity 
should be scale invariant at a critical point. The 
results of this analysis are shown in Fig. 2B, 
with independent fits of the data near p, to 
second-order polynomials for each value of L. 
From these fits, we estimate the mutual cross- 
ing point for all three system sizes to be p>? = 
0.22 + 0.01. A collapse of the data are shown in 
Fig. 2C for a posited scaling function g(p,L) = 
Fen[L!/?e» (p — p®?)|, from which we obtain a 
critical exponent 1/V,xp = 0.85 + 0.05. These ex- 
perimental results are in good agreement with 
the numerical results of p, = 0.222 and 1/v = 0.9 
reported in literature (75). This observation con- 
firms that the embedded classical problem is in 
the same universality class as the 3D cubic lat- 
tice problem in the limit of vanishing transverse 
field (25). 

We next turned to the identification of phase 
transitions between the PM, AFM, and SG 
phases as a function of [/7 and kgT/7. Un- 
like the above section, we worked in the quan- 
tum regime, where I’ is nonzero. The PM-to-AFM 
phase transition is known to be of second order 
(18), thus motivating the use of a linear suscep- 
tibility measurement that would diverge at a 
critical point in the thermodynamic limit L — oo. 
Because the ordered phase at p = 0 is AFM, we 
applied a global longitudinal magnetic field that 
was uniform in magnitude but alternating in 
sign between nearest-neighbor sites (+h). We re- 
fer to the quantity yar = SMary/Sh|,_,.0 as the 
AFM or staggered linear susceptibility. yar was 
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Fig. 3. Example spin-spin overlap probability 
distributions P(q). (A) Example PM-to-AFM 
phase transition for a p = 0.1 instance at four 
representative values of s. The results for s = 0.1 
have been magnified x10. (B) Example PM-to- 
SG phase transition for a p = 0.5 instance at 
four representative values of s. 
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Fig. 4. p - I'/7 and kgT/7- I'/7 phase diagrams. (A) p —I'/7 phase diagram computed from 
yarm peak positions (black points with error bars). All measurements performed at T = 12.0 mK mK. 
Thick dashed black line indicates experimentally determined critical disorder pe", and the thin 
dashed black lines indicate error estimates for that quantity. Critical transverse fields Pxpy/J and 
T'S,/J for an infinite system at T = O and critical disorder p, were taken from the literature. 

(B) xarm peak positions for varying relative coupling strength 7 translated into the ((/.7,kgT/7)-plane 
for p = O. yarm(S) data are provided in fig. S13. Annealing trajectories for varying relative coupling 
strength are indicated by dashed curves (supplementary materials), and the corresponding 
transition points are indicated by color-coordinated points, with error estimate along each trajectory. 
Trajectory and critical point denoted as 1.00x correspond to the experimental manifold depicted 

in Fig. 1A and the p = O point in (A), respectively. Trajectories and points labeled 0.33x, 0.50x, and 
0.67x indicate lower 7, whereas those labeled 1.33, 1.67x, 2.00, 2.33x, and 2.67~x indicate 
higher 7. Best fit of phase boundary and extrapolation to T = O is indicated by solid black curve. 
Dashed black curves indicate error bounds on the best fit model. Open circles represent the 


theoretical results (30) for an infinite system. 


measured as a function of QA control param- 
eter 0 < s < 1 for small sets of randomly chosen 
instances for a series of values of p. Disorder- 
averaged traces of yarm Versus s for 8 x 8 x 8 lat- 
tices are presented in Fig. 2, D to F. For p < p, = 
0.22, this quantity yields obvious peaks that 
broaden and shift to increasing s with increas- 
ing p in Fig. 2D. In what follows, we refer to the 
peak positions as s,. For p > p, = 0.22, s, re- 
mains fixed to within experimental resolution, 
but the peak amplitude continues to decrease 
with increasing p, as shown in Fig. 2E. Further- 
more, the peak broadens until p = 0.4, beyond 
which the breadth remains fixed. This latter 
observation is made more clear in the scaled 
plot of yarm(S)/Yarm(Sc) Versus s in Fig. 2F. 

The peaks observed for p < p, suggest an 
incipient divergence, which reliably indicates 
a second-order phase transition between the 
PM and AFM phases. For p > p,, the absence of 
such a divergence is in agreement with the re- 
sults of numerical simulations of the PM-to-SG 
phase transition in a related system (26). We 
consider this to be only part of the evidence 
needed to confirm that a PM-to-SG phase tran- 
sition has been observed. 

The AFM magnetization |77,rm|, as given in 
Eq. 2, is an order parameter that one can use 
to easily identify the AFM phase. However, 
(|marm|) = 0 in both the PM and SG phases. 
The latter two phases may be distinguished by 
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using an order parameter proposed by Edwards 
and Anderson (27), hereafter referred to as the 
spin-spin overlap g, which is defined as 


where A and B are independently determined 
states for a given instance of disorder. Given a 
statistically large number of such states, one 
can construct the probability distribution P(q), 
whose characteristics dramatically differ be- 
tween the PM, AFM, and SG phases. Examples 
of experimentally determined P(q) for an AFM 
(p = 0.1) and a SG (p = 0.5) instance of size 8 x 
8 x 8 at four values of s are shown in Fig. 3, A 
and B, respectively. In both cases, the results 
reveal a Gaussian-shaped P(g) centered at g = 0 
for s < s,, which is consistent with the PM phase 
in a finite-sized system. For the AFM example in 
Fig. 3A, P(g) bifurcates for s > s,, and all of the 
weight becomes consolidated in the two sharp 
peaks at g = +1 above the transition. This is the 
signature of an ordered phase, in which there is 
a single thermodynamic state up to Z sym- 
metry (28). For the SG example in Fig. 3B, P(g) 
broadens in the vicinity of s, and then develops 
multiple peaks that are approximately symmet- 
ric around g = 0 and continuous throughout. 
This is consistent with the existence of many 


competing thermodynamic states, as typically 
characterizes a disordered SG phase (28). 

The final analysis presented here is the phase 
diagram projected on to the (p,I’/.7)-plane for 
the manifold depicted in Fig. 1A. To convert s, 
inferred from linear susceptibility peak posi- 
tions to ['/.7, we used the lattice spin energy 
scale versus s mapping given in the supplemen- 
tary materials. The results of this mapping are 
shown in Fig. 4A along with the phases iden- 
tified based on the order parameter analysis 
presented above. For reference, we have also in- 
cluded p, from the literature (75), as well as values 
of Vip /J 8) and T§,/7(9) for an infinitely 
large system at T = 0. The experimentally deter- 
mined phase diagram projection yields a value 
of p, at T = O that agrees with numerical re- 
sults. As of the time of writing, we had not es- 
tablished an experimental method to resolve 
any possible reentrant behavior of the AFM-SG 
phase boundary in the (p, T)-plane, as is sus- 
pected to exist in the (p, 7)-plane (29). Critical 
transverse fields are systematically lower than 
the L — co results at T = 0 because we are 
constrained to measure on a particular man- 
ifold, as depicted in Fig. 1A. The intersection 
between this experimentally accessible man- 
ifold and the phase boundaries can be varied 
by altering the annealing trajectory, which can 
be achieved by adjusting the magnitude of the 
interaction energy 7 by an overall scale factor 
(with further details given in the supplementary 
materials). For fixed p, one can measure the yam 
peak position, similar to that for p = 0 depicted 
in Fig. 2D, as a function of relative scaling of 7. 
Doing so samples the PM-AFM phase boundary 
in the ((/.7,kgT/7)-plane. Results for p = 0 
are shown in Fig. 4B. Decreasing or increasing 7 
moves the phase boundary toward the classical 
transition point kpT<;,/J (16) or quantum 
critical point P{py,/7 (18), respectively. Fitting 
the rightmost edge of the phase boundary to a 
power law kgT/7&|0/7 — Peo" yielded 
an experimental quantum critical point Te ie 
5.1 + 0.3 and exponent dexp = 0.7 + 0.3. We have 
also included the theoretical results of Elliott 
and Wood (30) for an infinite-sized cubic lattice 
in Fig. 4B. 

The experiments reported on here demon- 
strate how an in situ programmable TFIM in an 
integrated circuit can be used as a materials 
physics simulator. The ability to manipulate in- 
dividual spins and bonds allows one to explore 
order parameters in ways that are not possible 
in bulk condensed matter systems. This work 
opens many potential avenues of research with- 
in the field of quantum magnetism, with the 
possibility of studying lattices with built-in 
geometrical frustration (31, 32), lattice gauge 
theories (33), and defect/domain formation. 
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GEODETIC MONITORING 


The rise, collapse, and compaction of 
Mt. Mantap from the 3 September 2017 
North Korean nuclear test 


Teng Wang", Qibin Shi»”, Mehdi Nikkhoo’, Shengji Wei"?*, Sylvain Barbot’”, 
Douglas Dreger**, Roland Biirgmann*, Mahdi Motagh*”, Qi-Fu Chen® 


Surveillance of clandestine nuclear tests relies on a global seismic network, but the 
potential of spaceborne monitoring has been underexploited. We used satellite radar 
imagery to determine the complete surface displacement field of up to 3.5 meters of 
divergent horizontal motion with 0.5 meters of subsidence associated with North Korea’s 
largest underground nuclear test. Combining insight from geodetic and seismological 
remote sensing, we found that the aftermath of the initial explosive deformation involved 
subsidence associated with subsurface collapse and aseismic compaction of the damaged 
rocks of the test site. The explosive yield from the nuclear detonation with best-fitting 
source parameters for 450-meter depth was 191 kilotonnes of TNT equivalent. Our results 
demonstrate the capability of spaceborne remote sensing to help characterize large 


underground nuclear tests. 


orld peace benefits from adherence to 
internationally negotiated nuclear test 
ban treaties whose signatories strive to 
promote the nonproliferation of nuclear 
weapons. In 2003, the Democratic Peo- 
ple’s Republic of Korea (North Korea) became 
the first country to withdraw from the 1968 
Treaty on the Non-Proliferation of Nuclear Weap- 
ons. North Korea has been conducting under- 
ground nuclear weapon tests with increasing 
intensity since 2006. On 3 September 2017, two 
seismic events separated by ~8.5 min were de- 
tected at North Korea’s Punggye-ri nuclear test 
site. Soon thereafter, North Korea’s state media 
reported the successful firing of a two-stage 
thermonuclear bomb test. The U.S. Geological 
Survey and the China Earthquake Networks 
Center determined a body-wave magnitude (77) 
of 6.3 for the first event (NKNT 6), much larger 
than any of the five nuclear tests since 2006 
(NKNT 1-5). Shortly thereafter, the scientific 
community started to determine the location, 
focal mechanism, and yield of the explosion by 
means of seismic waveforms and satellite op- 
tical imagery (7). Preliminary analysis revealed 
a predominantly isotropic explosive source lo- 
cated beneath Mount Mantap (J-3), which also 
hosted NKNT 2-5 (Fig. 1). 
The source properties of previous North Korean 
underground nuclear tests have been extensive- 
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ly studied using seismic waveforms (4-12), but 
surface displacements associated with these ex- 
plosions are rarely reported. Remote sensing with 
synthetic aperture radar (SAR) is a powerful tech- 
nique for monitoring deformation of Earth’s 
surface (13, 14) but its contribution to character- 
izing nuclear tests has been limited. NKNT 4, 
conducted on 6 January 2016, has been studied 
using SAR interferometry, but the interpretation 
of interferometric phase is difficult because of 
the single imaging geometry (15). Tracking the 
amplitude features of the SAR images (so-called 
pixel-offset tracking) is better suited when the 
interferometric phase is decorrelated (16). More- 
over, pixel offsets can map displacement along 
the radar line-of-sight and satellite-flying (azimuth) 
directions. In contrast to offset tracking of optical 
images, the SAR range offset is sensitive to the 
vertical displacement because of the slant-range 
imaging geometry, allowing for derivation of 
three-dimensional (3D) displacements (77-20). 
Here, we rely on detailed 3D displacements de- 
rived from submeter-resolution SAR images to- 
gether with seismic waveform data to reveal the 
complex processes that took place during and 
in the immediate aftermath of NKNT 6. 

We measured the surface displacements caused 
by NKNT 6 by cross-correlating high-resolution 
spotlight radar images acquired by the German 
TerraSAR-X satellite, with an azimuth resolution 
of 1.1 m and a slant-range resolution as fine as 
0.45 m (fig. S1 and table S1). The accuracy of the 
offset measurement is about one-tenth of the im- 
aging resolution (27). We combined the azimuth 
and range offsets from two ascending and two 
descending tracks to calculate the total 3D surface 
displacements produced during and in the im- 
mediate aftermath of the explosion on a 300 m x 
300 m grid (Fig. 1 and figs. S3 to S5) (22). The 
horizontal motions of up to 3.5 m show a diver- 
gent pattern at the top of Mount Mantap with 


a central zone of subsidence of ~0.5 m. We de- 
composed the 3D displacements into vertical 
and horizontal directions along two topographic 
profiles across the top of Mount Mantap (Fig. 1). 
The along-profile displacements show that the 
horizontal displacement is generally larger where 
the topography is steeper (the west and south 
flanks). However, the direction of motion does 
not follow the slope of the terrain but is nearly 
horizontal. This indicates that although there is 
strong topographic control on the surface dis- 
placement field caused by the buried explosion, 
it does not resemble the slope-parallel motions 
expected from triggered landslides. Although 
optical imagery suggests isolated landslide de- 
posits at the 10- to 100-m scale (23), these appear 
to be debris flows localized in preexisting channels 
that could not produce the large-scale horizontal 
motions we observed. 

To resolve the horizontal location and depth 
of the detonation chamber, we set up numerical 
models that predict the surface displacements 
attributable to the expansion and subsequent 
collapse of an underground cavity embedded in 
a uniform elastic crust below realistic surface 
topography (24, 25) (Fig. 2). We constrained the 
3D location of the source by minimizing the 
misfit between predicted and observed surface 
displacements (26, 27). The explosion and im- 
mediate collapse of a spherical cavity of 300 m 
radius that includes the detonation chamber 
and the surrounding damaged material repro- 
duced the horizontal displacement well, but 
this was not sufficient to explain the small ver- 
tical motion around Mount Mantap. A third, 
mostly aseismic process involving the compac- 
tion of a larger volume is invoked to explain 
the low uplift (Fig. 2A and fig. S6). A similar 
compaction process has been inferred from the 
seismic analysis of other explosions (28) and 
was observed in the weeks to months following 
underground nuclear tests conducted in Nevada 
(29). As we do not have any constraints on the 
geometry of the compaction zone, we assumed 
a generalized ellipsoidal geometry for it and 
inferred its dimensions by using the geodetic 
observations. We estimated the explosive source 
to be located at 129.078°E, 41.300°N + 50 m, 
1750 + 100 m above mean sea level (i.e., 450 + 
100 m below the top of Mount Mantap). Incor- 
porating the large-scale compaction source into 
the model does not influence our inferred epicenter 
of the explosion/collapse source much (22). 

Assuming that the hypocenter of the first 
event coincides with the center of the spherical 
cavity, we refined the relative location for the 
second seismic event using local seismic wave- 
form records from the NorthEast China Seismic 
Array to Investigate Deep Subduction (NECsaids) 
(30) and regional data from South Korean sites 
archived at the Incorporated Research Institu- 
tions for Seismology (IRIS) (Fig. 3A) (22). With 
the calibration from the first event and careful 
P-wave arrival picks of the second event (figs. 
S7 and S8), our grid search showed that the sec- 
ond event occurred 8 min 31.79 s after the first 
event and was located about 700 m to the south. 
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Fig. 1. Three-dimensional displacement associated with the 

3 September 2017 North Korea nuclear test (NKNT 6). (A) 3D 
displacements derived from radar imagery. Arrows indicate horizontal 
displacement; color indicates vertical motions spanning the explosion 
and ~1 week of additional deformation. The uncertainties are shown in 

fig. S4 and provided in data S1 with the displacements. The black outline 
derived from ALOS-2 coherence loss indicates the substantial surface 
disturbance and large displacement gradients caused by the explosion over 
an area of ~9 km? (figs. Sl and S2). Thin gray lines are topographic 
contours at 100-m intervals. The red square in the upper right inset shows 


Because of the azimuthal gap in the station cov- 
erage, the east-west location (+700 m) is less well 
constrained relative to the north-south separation 
(+200 m with 96% confidence) (Fig. 3C and fig. 
89). The refined location of the second implosive 
event is beneath the area of large subsidence and 
southward horizontal motion under the south flank 
of Mount Mantap, between the initial explosion 
and the south portal of the tunnel system (Fig. 1). 

We applied the generalized cut-and-paste 
(gCAP) method (37) to the regional and local 
waveform data to invert for the full moment 
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tensor solutions of the two seismic events, includ- 
ing an isotropic component (i.e., explosive or im- 
plosive volume source), a compensated linear 
vector dipole (CLVD) component (i.e., ring fault- 
ing along a certain axis, such as a collapse), and 
double-couple component (i.e., shear disloca- 
tion on a planar fault) (22). Our preferred solu- 
tion of the first event indicates a moment of 9.5 x 
10° N-m (moment magnitude M,, = 5.24), a 50 to 
90% positive isotropic component, and relative- 
ly small CLVD or double-couple contributions 
(figs. S10 to S13). The second seismic event (M,, = 


Distance along profile (m) 


the location of Mount Mantap (DPRK, Democratic People’s Republic of 
Korea; ROK, Republic of Korea). Red stars indicate the locations of 
NKNT 1-5 (1, 6, 9, 15, 37), among which NKNT 2-5 were all located 
within the NKNT 6 low-coherence region; NKNT 1 on 9 October 2006 was 
in a different location (5). Beach balls show locations and focal 
mechanisms of the M,, 5.24 and M,, 4.47 events on 3 September 2017. 
(B and C) 2D (horizontal along the profile and vertical) 

displacements along two profiles across the top of Mount Mantap 
from north to south and from west to east, respectively. No vertical 
exaggeration in (B) or (C). 


4.5) has a large negative isotropic component (~50 
to 70% of the total moment) (figs. S14 and S15). 
Although we obtained a high waveform cross- 
correlation coefficient between the data and 
synthetics for most of the waveform components 
of the first event (e.g., Fig. 3B), the noise level for 
the second event is larger, resulting in a much 
smaller variance reduction of the observations 
(fig. S16). To overcome this limitation of the 
data, we sought more information about the 
moment tensor of the second event by directly 
comparing the waveforms with those of the 
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Fig. 2. Model geometry and fit to the observed surface displace- 
ments. (A) Perspective view of the model with topography and variance 
reductions as a function of centroid position (both cross sections are 
centered on the best-fit location). We represent the first event, combining 
the explosion and immediate collapse, using a sphere of 300 m radius with 
a centroid located at a depth about 450 m below Mount Mantap. We model 
the aseismic subsidence detected with geodetic data about a week after 
the seismic events 1 and 2 with an ellipsoid of dimension 800 m x 800 m x 
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470 m (semi-axes), centered at 100 m deeper than the explosive source. 
The isotropic components of the moment tensors are represented as 
beach balls. (B) The interferometric SAR (InSAR) observed and simulated 
surface displacements. (C) The west-east and south-north profiles of the 
surface displacements from the SAR observations and the best-fitting 
models. The dashed profiles represent the contributions of the explosion/ 
collapse (event 1) and the subsequent aseismic compaction on the 
surface displacement. We ignore the deformation caused by event 2. 
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first event. We multiplied the amplitude of the 
vertical-component waveforms of the second 
event by a factor of -60 and compared them 
with the waveforms of the first event at higher 
frequencies (~0.2 to 0.9 Hz). The result (Fig. 3D 
and fig. S16) shows very high waveform cross- 
correlation coefficients, even for some coda waves, 
supporting the close locations but opposite 
isotropic polarities of the two events (2). 
Combining the depth constraints from geodesy 
and energy constraints from seismology, we can 
refine the explosive yield of the nuclear explosion 
(32-34). We assumed the seismic velocity model 
MDJ2 (4) for the elastic Earth structure. We based 
the overburden pressure on the best-fitting cen- 
troid source depth of 450 + 100 m from the 
geodetic modeling. The medium in which the de- 
vice was detonated was likely the granodiorite 
that lies beneath the stratified volcanic rocks that 
make up the high elevations of Mount Mantap 
(8). We assumed a gas porosity of 1% for granitic 
rocks (35). Considering an isotropic seismic 
M,, = 5.05 (the mean value of solutions fitting 
within 95% of the maximum fit for a source 
depth of 450 m) and the possible range of source 


depths of 350 to 550 m, the yield estimates range 
between 171 and 209 kt of TNT equivalent, with 
191 kt corresponding to the best-fitting source 
parameters from geodetic and seismic data (Fig. 
3E). Doubling the gas porosity results in an 8% 
increase in the magnitude of the estimated 
yield. 

The source characteristics we derived from 
surface displacement and seismic waveforms 
are in remarkable agreement. The divergent 
horizontal motions and the moment tensor of 
the first event consistently suggest a predomi- 
nant isotropic explosive source buried at shal- 
low depth. The moment of the geodesy-derived 
models, assuming an empirical rigidity of 5.7 GPa 
(36), is My, = 5.5, larger than the one inferred 
seismically (M,, = 5.24), because it includes slow 
deformation that did not generate seismic waves, 
with a total volume change of 0.01 km”. The seis- 
mic analysis of the second event reveals an im- 
plosive seismic source that occurred south of the 
first event with a dominant negative isotropic 
component, suggesting an inverse process of the 
main explosion. This may reflect the combination 
of negative isotropic compaction of the over- 


pressured cavity and/or vertical collapse of the 
explosion chimney and nearby tunnel segments 
due to gravity, contributing to the subsidence on 
the south flank of Mount Mantap (Fig. 4). The 
larger-scale compaction source in the geodetic 
model is independent of the first and second 
seismic events, and the post-explosion compac- 
tion of surrounding rocks may continue aseis- 
mically for an extended period, as seen in the 
Nevada underground nuclear test site with initial 
subsidence rates of ~1 to 7 cm/year (29). 

Our 3D surface displacement measurements 
and elastic modeling incorporating realistic 
topography allow for locating the main explosive 
event within +50 m, assuming a uniform elastic 
medium, although a nonuniform structure and 
small-scale surficial processes (e.g., landslides) 
may bias our results. Because the largest defor- 
mation occurred above the explosive source as a 
result of the chimneying and spalling effect (28), 
the centroid of the modeled geodetic source may 
be located above the actual detonation point. 
After the explosion, water was observed to be 
flowing from the tunnel portal (23). Assuming a 
slope of 2° to 4° to provide drainage, the depth 
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Fig. 3. Analysis of seismic waves. (A) Station map of broadband 
seismometers with four stations in (B) and (D) highlighted in red. The 
black and red stars are the epicenter locations of the first and second event, 
respectively. (B) Moment tensor solutions for the first explosive event 
(left) and the second implosive event (right), with the vertical component 
of two representative stations shown at the bottom. Both data and synthetics 
are filtered between 0.02 and 0.045 Hz. Station names are shown at the 
beginning of waveform pairs; distance (in kilometers) and azimuth (in 
degrees) are indicated below. (C) Grid search result (under L1 norm) for 
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relocating the second event relative to the first event (black star). Marginal 
distributions for the epicentral position are plotted along the northing 
and easting axes. (D) Vertical component waveform comparison between 
the first (black) and the second (red) event at two representative stations, 
with the second event waveforms multiplied by —60. (E) Explosive yield in 
the context of historical nuclear tests. The black dots and error bars show 
yields estimated according to the mean and standard deviation of tabulated 
moment within 95% of the best-fitting solutions with depths of 300 m, 

450 m, and 600 m, respectively. 
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A Explosion 


surface uplift 


C Second seismic source 


Fig. 4. Summary deformation scenario for the 3 September 2017 North 
Korea nuclear test. (A, B, and D) The unfolding of events includes the 
succession of explosive (A), collapse (B), and compaction (D) processes, with 


implied from the elevation of the tunnel en- 
trance is about 600 to 700 m below the surface, 
consistent with a detonation point about 150 m 
deeper than the centroid of the geodetic model. 
Combining the available space-borne geodetic 
and seismic records provided new insights into 
the mechanics of deformation surrounding North 
Korea’s sixth underground nuclear test, reveal- 
ing the explosion, collapse, and subsequent com- 
paction sequence (Fig. 4). The modeling of the 
geodetic observations reduces the epicentral and 
depth uncertainties that otherwise hinder the anal- 
ysis of seismic waveforms. The derived horizontal 
location of the first event is important to relatively 
relocate the second event, which likely indicates 
the collapse of the tunnel system of the test site. 
The inclusion of geodetic data also helps to re- 
solve the aseismic deformation processes that may 
follow nuclear tests. Finally, our findings dem- 
onstrate the capability of monitoring shallow 
underground nuclear tests by means of remote- 
sensing observations and seismic sensors. 
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different associated surface displacements. (C) The implosive source may be 
shallow and may only contribute localized surface displacements. The radar imagery 
reveals the deformation [arrows in (D)] resulting from the three processes. 
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Deconstructive fluorination of cyclic 
amines by carbon-carbon cleavage 


Jose B. Roque*, Yusuke Kuroda*, Lucas T. Géttemann, Richmond Sarpongt{ 


Deconstructive functionalizations involving scission of carbon-carbon double bonds are well 
established. In contrast, unstrained C(sp*)—C(sp*) bond cleavage and functionalization 
have less precedent. Here we report the use of deconstructive fluorination to access mono- 
and difluorinated amine derivatives by C(sp*)—C(sp*) bond cleavage in saturated nitrogen 
heterocycles such as piperidines and pyrrolidines. Silver-mediated ring-opening fluorination 
using Selectfluor highlights a strategy for cyclic amine functionalization and late-stage 
skeletal diversification, establishing cyclic amines as synthons for amino alkyl radicals and 


providing synthetic routes to valuable building blocks. 


he ubiquity of carbon-carbon (C-C) bonds 
in organic compounds places a premium 
on methods that construct such bonds. In 
general, these bond constructions lead to 
an increase in structural complexity. How- 
ever, in certain cases, the cleavage of C-C bonds 
may lead to more synthetically complex products 
that cannot be prepared efficiently in any other 
way. Historically, the full benefit of C-C bond 
cleavage (deconstructive strategies) has often 
been realized by coupling this process with 
value-added bond constructions such as C-C 
bond formation (e.g., in olefin metathesis pro- 
cesses) (1, 2) or C-O bond formation (e.g., in 
ozonolysis) (3) (Fig. 1A). Whereas the benefits of 
these C(sp)=C(sp”) double-bond cleavage and 
functionalization processes are well established, 
the value of deconstructive processes becomes 
even more apparent when C(sp*)-C(sp”) single- 
bond cleavage and functionalization are con- 
sidered, especially in the context of late-stage 
skeletal diversification to access unexplored 
chemical space (4) (Fig. 1B). The development 
of deconstructive functionalizations of cyclic 
amines would be particularly useful, given their 
ubiquity in pharmaceuticals and agrochemicals 
(5, 6). However, methods for the ring-opening 
of cyclic amines remain extremely limited and 
are dominated by C-O bond formation through 
o-oxidation followed by heterolytic C-N bond 
cleavage via the well-established equilibrium 
with the hemiaminal that forms (7-12). Although 
variants of the von Braun reaction of nucleo- 
philic tertiary N-alkyl-substituted cyclic amines 
lead to C-Cl bond formation via heterolytic ring- 
opening, this strategy is limited to cyclic amines 
with small ring sizes owing to competing N- 
dealkylation (73). With these limitations in mind, 
we sought a mechanistically distinct strategy that 
would provide a general entry to the deconstruc- 
tive functionalization of cyclic amines. 
In this context, reactions that form C(sp”)-F 
bonds are among the most valued bond construc- 
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tions because of the influence of fluorine sub- 
stitution on the properties of pharmaceuticals, 
agrochemicals, and organic materials (4-17). For 
example, installation of fluorine may lead to in- 
creased metabolic stability, altered physicochem- 
ical properties such as increased lipophilicity, 
reduced basicity of nearby nitrogen atoms, and 
conformational tuning. As a consequence, sub- 
stantial progress has been made on site-selective 
reactions that form C(sp®)-F bonds (18, 19). 
Nevertheless, the development of methods for 
C(sp)-F bond formation that facilitate the prep- 
aration of a variety of fluorine-containing build- 
ing blocks from easily available starting materials 
remains a prominent goal. Here we report a de- 
constructive strategy to transform cyclic amine 
derivatives into versatile fluorine-containing 
acyclic amine derivatives, using commercially 
available reagents, through C(sp”)-C(sp”) single- 
bond cleavage followed by C(sp?)-F bond forma- 
tion (Fig. 1C). 

Our strategy for deconstructive fluorination of 
cyclic amine derivatives is based on two discrete 
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stages, each mediated by a silver salt (Fig. 1C). In 
the first stage, we envisioned that under an ap- 
propriate set of oxidative conditions, a saturated 
cyclic amine would be oxidized to the correspond- 
ing iminium ion A, which would be trapped by 
HO to form hemiaminal B. In the second stage, 
the resulting hemiaminal B could undergo homo- 
lytic ring-opening upon engaging the silver salt 
to yield primary radical C (20). A subsequent 
fluorine atom transfer would deliver the desired 
fluorinated product. On this basis, cyclic amines 
could be viewed as synthons for amino alkyl 
radicals, which have conventionally only been 
shown to arise from the corresponding halide, 
alcohol, or carboxylic acid derivatives (27). Al- 
though this strategy is conceptually simple, sev- 
eral challenges were inherent in putting it into 
practice. First, only a few methods exist for oxida- 
tion of amines to the corresponding hemiaminals 
owing to the competing over-oxidation to amides 
(22). Furthermore, no reports exist of o-oxidation 
of cyclic amine derivatives using silver salts. How- 
ever, given the oxidation potential of N-protected 
cyclic amines such as 1a [anodic peak potential = 
+1.13 V versus saturated calomel electrode (SCE)] 
(fig. S1), we theorized that Ag(II) salts could be 
sufficiently oxidizing [standard reduction potential 
(Ag?*/Ag*) = +1.98 V versus SCE] (23) to effect 
single-electron transfer. Second, most of the re- 
ports of successful ring-opening fluorination are 
limited to strained tertiary cycloalkanols such as 
cyclobutanols (24-27); only limited examples that 
feature relatively unstrained cycloalkanols such 
as cyclopentanols and cyclohexanols are known, 
and these cases resulted in low yields (25). The 
challenge in achieving our envisioned transfor- 
mation rested on identifying a silver salt and 
fluorinating reagent combination that would 
act in synergy to selectively cleave and function- 
alize the desired C-C bond. 

We began our investigation by establishing the 
conditions for the overall transformation using 
NN-Bz piperidine 1a as the substrate (Bz, benzoyl) 


C(sp*)}-C(sp*) 


bond paisa il - S 


Not readily available 


rrr 


cl 
No -  AgBF, (4 equiv) a 
el 
‘N> acetone:H,O (1:9) N, 
F 40°C, 1h Bz 


Selectfluor 2a 
(4 equiv) 
entry variation from the standard conditions yield (%)* 
1 none git 
AgNOs instead of AgBF4 42 
3 no [Ag] () 
4 NEFSI instead of Selectfluor 0 
5 MeCN instead of acetone 51 
6 AgBF, (50 mol%) 52 


“Yield by 'H NMR integration using PhgCH as an internal standard. 
T Isolated yield. 


Fig. 1. Development of a deconstructive fluorination of cyclic amines. (A) Well-established 
deconstructive functionalization. (B) An elusive deconstructive functionalization. (C) A blueprint for 
deconstructive fluorination of cyclic amines. (D) Optimization of silver-mediated deconstructive fluorination 
of N-Bz piperidine 1a. R, any functional group; Me, methyl; Ph, phenyl; equiv, equivalents; h, hour. 
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(Fig. 1D). After extensive screening of various 
conditions, we identified the optimized condi- 
tions shown in entry 1, which use cheap and 
commercially available AgBF, in a 9:1 (v/v) mix- 
ture of H,O/acetone at 40°C. Other silver sources 
led to lower yields, with AgNO; providing the 
highest yield among them (entry 2). A control 
experiment established that a silver salt is es- 
sential to obtaining the desired fluorinated pro- 
duct (entry 3). Other fluorinating reagents such 
as N-fluorobenzenesulfonimide (NFSI) led to 
no reaction (entry 4). A 9:1 (v/v) mixture of H,O/ 
MeCN gave a diminished yield (51%) of 2a (entry 5), 
pointing to the superiority of acetone as the 
cosolvent. The overall transformation can be con- 
ducted with substoichiometric amounts of AgBF 4 
to provide 2a, albeit in modest yield (entry 6). 
With the optimized conditions established, we 
proceeded to investigate the scope of the de- 
constructive fluorination process. As shown in 
Fig. 2, several structurally and electronically dis- 
tinct N-substituted piperidine derivatives were 
fluorinated effectively, including those bearing 
acetyl (Ac, Ib), tert-butoxycarbonyl (Boc, le), and 
pivaloyl (Piv, 1d) groups. The deconstructive 
fluorination method is not limited to piperidine 
derivatives; a range of N-benzoylated saturated 
azacycles including azetidine le, pyrrolidine 1f, 
azepane 1g, and azocane th were all viable in the 
deconstructive fluorination reaction. Fluorinated 
products 2e and 2f were obtained in the de- 
formylated form (vide infra). The variation of the 
cyclic amine substrate ring size led to fluoro- 
amine derivatives bearing carbon chains of vary- 
ing lengths. A variety of substitution patterns on 
the piperidine ring were also well tolerated, and 
the corresponding acyclic fluorinated amines were 
obtained in moderate to good yields (50 to 85%). 
Some of the fluorinated alkyl amine products have 
not been previously reported and would not be 
readily accessible by conventional deoxyfluorina- 
tion strategies owing to the limited availability 
of the corresponding substituted linear amino 
alcohols. For example, 2-substituted piperidines 
Wj and Ik afforded the corresponding fluoroamines 
2j and 2k, respectively, with complete positional 
selectivity. The observed selectivity for cleavage 
away from the substituents may be attributed to 
the steric hindrance imparted by these groups at 
the a-position of the cyclic amines. 3-Substituted 
piperidines were also good substrates, as evidenced 
by Il undergoing ring-opening and fluorination 
to provide 21 in 50% yield, demonstrating that 
secondary alkyl fluorides can be accessed by this 
method. Fused piperidines such as Im underwent 
deconstructive fluorination to provide 2m in 43% 
yield. This example demonstrates that polycyclic 
molecules can be functionalized as well, paving 
the way for late-stage skeletal diversification of 
complex molecules. Moreover, L-pipecolic acid 
derivative In gave 5-fluoro- L-norvaline derivative 
2n in 68% yield (three steps from L-pipecolic acid), 
considerably shortening the synthesis of 5-fluoro- 
L-norvaline (previously prepared in seven steps 
from L-glutamic acid) (28). L-proline methyl ester 
derivative lo was converted to pyrrolidinone de- 
rivative 20, presumably by over-oxidation of an 
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Fig. 2. Deconstructive fluorination: cyclic amine scope. Only isolated yields are shown. Reaction 


conditions: 1 (0.1 mmol), AgBF4 (4 equivalents), 


Selectfluor (4 equivalents), acetone:H20 (1:9), 


40°C, 1 hour. *Deformylated product obtained. dr, diastereomeric ratio. 


intermediate 5-hydroxyproline derivative. Simi- 
larly, N-methyl-2-piperidinone (1p) and N-methyl- 
2-pyrrolidinone (1q) were oxygenated under the 
reaction conditions to give N-methyl imides 2p 
and 2q, respectively. These oxygenation reac- 
tions are important in their own right because 
methods for the direct o-oxygenation of cyclic 
amides are dominated by strongly oxidizing RuO, 
(22). N-Piv-2-pyrrolidinone Ir afforded fluorinated 
product 2r under the same conditions. Moreover, 
piperidines containing carboxylic acid groups 
underwent dual functionalization to provide di- 
fluorinated amines through decarboxylative (29) 
and deconstructive fluorination. For example, 
NN-Bz piperidine 1s and 1t underwent dual fluo- 
rination to provide 3,5-difluorinated amine 2s 
(28%) and 4,5-difluorinated amine 2t (22%), along 
with N-Bz 3-fluoropiperidine (28%) and N-Bz 
4-fluoropiperidine (20%), respectively. 

As a demonstration of the utility of this method, 
we considered functionalizing synthetic peptides, 
which continue to see widespread use in drug 
discovery (30-32). Deconstructive functional- 
ization of peptides can provide orthogonal and 
complementary skeletal diversification and add 
profitably to the toolbox of available methods 
(33). When dipeptide 3a, which has a valine 
residue, was subjected to our reaction conditions, 
the fluorination proceeded readily to afford fluo- 
rinated dipeptide 4a in 50% yield along with 


25% recovered starting material (Fig. 3). In this 
case, prolonged reaction times did not lead to an 
increase in the product yield, presumably owing 
to product inhibition. Internal peptides such as 
3b also underwent deconstructive fluorination 
to provide 4b in 38% yield (accompanied by 
40% of recovered starting material). Likewise, 
tripeptide 3e was converted to 4e in 39% yield, 
along with 29% of recovered starting material. 
This methodology could also be applied to the 
selective C5-oxygenation of L-proline-containing 
peptides. For example, peptide 3d was oxygenated 
in 76% yield to provide 4d. N-Benzoyl imide- 
containing peptide 4d served as a versatile in- 
termediate for further functionalization. We did 
not observe racemization of the fluorinated and 
oxygenated peptides, which were obtained as 
single diastereomers. 

A number of additional experiments were per- 
formed to elucidate the reaction mechanism. We 
began by using nuclear magnetic resonance (NMR) 
spectroscopy to investigate the interaction of 
Ag(I) and Selectfluor. Unexpectedly, a °F NMR 
spectrum of an equimolar mixture of Selectfluor 
and AgBF, in a 1:9 (v/v) mixture of acetone-d,/ 
D,O, acquired after stirring at 40°C for 1 hour 
(fig. S2), displayed no consumption of Selectfluor 
(34, 35). However, in the presence of an equiv- 
alent of 1a, consumption of Selectfluor was ob- 
served, suggesting that the N-protected cyclic 
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amine substrates play an important role in ini- 
tiating the ring-opening and fluorination pro- 
cess. In addition, line broadening in the ‘'H NMR 
spectrum was observed, suggesting the forma- 
tion of a paramagnetic Ag(ID complex. Further- 
more, downfield shifts of NMR resonances of 1a 
were observed in the ‘H NMR spectra upon ad- 
dition of AgBF, (fig. S3), suggesting the binding 
of Ag(I) to the amide moiety of la (36, 37). On 
the basis of these NMR experiments, we propose 
a mechanism that involves initial coordination of 
Ag(I) to 1a, followed by single-electron oxidation 
by Selectfluor to form Ag(II) and radical dication 
5 (Eq. 1 in Fig. 4A) (35). The resulting Ag(II) 
species would undergo single-electron transfer 
from 1a. Subsequent hydrogen-atom abstraction 
by 5 (38) would deliver iminium ion A, and this 
would be followed by trapping by H,O to give 
hemiaminal B (Eq. 2 in Fig. 4A). An alternative 
mechanistic pathway wherein radical dication 5 
undergoes a-amino C-H abstraction of 1a fol- 
lowed by single-electron transfer by Ag(II) to 
generate the same iminium ion A cannot be 
ruled out. From hemiaminal B, an alkoxy Ag(II) 
intermediate may form (not shown). Opening of 
this intermediate to primary radical C would 
achieve the desired C(sp?)-C(sp”) bond cleav- 
age, and attendant fluorination of the radical by 
Selectfluor would yield 2a (path A in Fig. 4B). 
However, we recognize that another pathway 
could be operable based on the fact that defor- 
mylated products were obtained in some cases. In 
this alternate pathway, opening of the hemiaminal 
to linear aldehyde D and subsequent oxidation 
to the corresponding carboxylic acid would then 
set the stage for a decarboxylative fluorination— 
in line with the precedent of (29) (path B in Fig. 
4B). In aseries of experiments to support or refute 
either mechanism (Fig. 4C), aldehyde D, which 
likely exists in equilibrium with hemiaminal B, 
was subjected to our reaction conditions and 
gave fluoroamine 2a in 55% yield, which can 
only be accessed through path A. In addition, 
when the reaction was conducted over a pro- 
longed period, the benzoyl amide was obtained 
as the major product, indicating that the con- 
version of 2a to 6 likely occurs through a de- 
formylation process. However, the successful 
fluorination of N-phthaloyl aldehyde 7 demon- 
strates that fluorination can proceed from the 
aldehyde, which cannot form the hemiaminal. 
That 7 gave a higher yield than la and D is a 
result of the relative stability of the product 8 
under the oxidative reaction conditions that 
were used. Subjecting carboxylic acid 9 to the 
optimized conditions resulted in 23% yield of 
fluorinated product 6. On the basis of these 
experiments, we cannot rule out the possibility 
that path B is operative for the small subset of 
substrates that gave exclusively deformylated 
products. Lastly, on the basis of our mechanistic 
proposal, we sought to explore the reactivity of 
enamides under our optimized reaction condi- 
tions. We envisioned enamide 10 undergoing 
electrophilic fluorination followed by trapping 
of the resulting carbocation with water to yield 
11 (39). An alkoxy Ag(II) intermediate would 
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follow, leading to C(sp?)-C(sp?) cleavage and 
fluorination to yield gem-difluorinated protected 
amine 12. As shown in Fig. 4D, in a reaction using 
catalytic amounts of silver salts, a variety of 
enamides related to 10 underwent the desired 
deconstructive difluorination to yield 12a to 12¢ 
in 54 to 76% yield. In support of path A (Fig. 4A), 
the formyl imide products were isolated as the 
major product under the optimized reaction 
conditions. These results are productive given 


the established importance of difluoromethyl 
groups (40, 42). For example, the difluoromethyl 
moiety serves as a lipophilic hydrogen bond 
donor that acts as a bioisostere for thiol and 
hydroxyl groups. 

The simple protocol of the deconstructive fluo- 
rination, which proceeds in aqueous solvent mix- 
tures as well as in water alone (42, 43), should 
lead to its widespread adoption for late-stage 
skeletal diversification in the pharmaceutical 
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and agrochemical arena. From the retrosynthetic 
viewpoint, cyclic amines can now be regarded as 
synthons for amino alkyl radical intermediates, 
which can be engaged by a variety of coupling 
partners. Thus, we anticipate that this method 
will unlock fundamentally different disconnec- 
tion strategies. 
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EARLY OCEAN 


Late inception of a resiliently 
oxygenated upper ocean 


Wanyi Lu’, Andy Ridgwell?*, Ellen Thomas*”, Dalton S. Hardisty®, Genming Luo’, 
Thomas J. Algeo”®®, Matthew R. Saltzman’, Benjamin C. Gill, Yanan Shen’”, 
Hong-Fei Ling", Cole T. Edwards™, Michael T. Whalen’’, Xiaoli Zhou’, 

Kristina M. Gutchess’, Li Jin’®, Rosalind E. M. Rickaby’’, Hugh C. Jenkyns’’, 
Timothy W. Lyons”, Timothy M. Lenton"®, Lee R. Kump’, Zunli Lu’* 


Rising oceanic and atmospheric oxygen levels through time have been crucial to 
enhanced habitability of surface Earth environments. Few redox proxies can track secular 
variations in dissolved oxygen concentrations around threshold levels for metazoan 
survival in the upper ocean. We present an extensive compilation of iodine-to-calcium 
ratios (I/Ca) in marine carbonates. Our record supports a major rise in the partial 
pressure of oxygen in the atmosphere at ~400 million years (Ma) ago and reveals a step 
change in the oxygenation of the upper ocean to relatively sustainable near-modern 
conditions at ~200 Ma ago. An Earth system model demonstrates that a shift in organic 
matter remineralization to greater depths, which may have been due to increasing 
size and biomineralization of eukaryotic plankton, likely drove the I/Ca signals at 


~200 Ma ago. 


he evolution and survival of marine ani- 
mals depended on oxygen availability, par- 
ticularly in upper ocean waters, ranging 
from the sea surface to the thermocline, 
during early Earth history (7). The dissolved 
oxygen concentration ({O.]) in the upper ocean 
commonly decreases from the well-mixed surface 
ocean (the top few tens of meters) into deeper 
subsurface waters (a few hundred meters). This 
[Oz] gradient is controlled by three key factors: 
(i) the partial pressure of oxygen in the atmo- 
sphere (Pog), (ii) the intensity of upper-ocean 
mixing, and (iii) the oxidation of organic matter 
in the water column, which consumes oxygen 
(2). Atmospheric Po, changes through time have 
been estimated via geochemical proxy data and 
box models (3). Oceanic paleoredox proxies typi- 
cally track the areal extent of euxinic waters 
(containing H,S) and the presence or absence 
of anoxia (positive or zero [O2]) (4, 5). Because 
most modern marine animals are sensitive to 
[O.] changes between ~10 and ~100 umol/kg (2), 
development of long-term proxy reconstruc- 
tions for [O] in this critical range (oxic-hypoxic) 
would help elucidate when and how oceanic 
oxygenation evolved to accommodate the mod- 
ern ecological landscape. 
Carbonate I/Ca is one of the proxies devel- 
oped for the oxic-hypoxic window that has the 
potential to reconstruct secular trends in upper- 


ocean oxygenation (6, 7). The long residence time 
of iodine (~300,000 years) leads to generally 
uniform total iodine concentrations in the mod- 
ern ocean, but speciation changes of iodine be- 
tween iodate (IO; ) and iodide (I) are controlled 
locally (8, 9). IO; is completely reduced to I in 
waters at low [O.] (8, 9) and reoxidized under 
well-oxygenated conditions. Because IO; is the 
only chemical form of iodine incorporated into 
the carbonate structure (7) by replacing the CO3”" 
ion (10), carbonate I/Ca records of local seawater 
[10 ] through time can indicate changes in [O.]. 
Carbonate I/Ca has been shown to be a reliable 
tracer that responds primarily to [O.] variations 
in marine environments over a wide range of 
geological periods (6, 11-16). 

We measured I/Ca in an extensive Phanerozoic 
collection of shallow marine carbonates likely 
forming within the top 200 m of the water col- 
umn and compiled these values with published 
data (table S1 and Fig. 1A). Maximum I/Ca values 
for individual localities were generally low in the 
Proterozoic, except for periods that have been 
associated with potential atmospheric Po» rises 
[e.g., the Great Oxidation Event (72)] and some 
Neoproterozoic carbon isotope excursions (11, 15), 
when maximum values temporarily increased 
to Cenozoic levels (3 to 4 umol/mol) (Fig. 1A). 
Paleozoic maximum values are comparable to 
those of the Proterozoic, despite a relatively short 


spike during the Devonian, at about 400 million 
years (Ma) ago, when the 75th percentile values 
reached Cenozoic levels. Break-point analyses 
indicate a step change at Triassic to early Jurassic 
(~200 Ma ago, fig. S1), after which maximum 
values remain above 4 mol/mol and 75th per- 
centile values are mostly higher than 3 pmol/mol 
(6, 13). The Devonian I/Ca excursion and the step 
change at ~200 Ma ago are two key observations 
in this data compilation. 

The stark contrast between predominantly low 
Paleozoic values and high Meso-Cenozoic values 
(excluding the Triassic, i.e., <200 Ma ago) can- 
not be explained by sampling biases. The sample 
size for the Paleozoic (n = 894) is comparable to 
that for the Proterozoic (n = 1078) and the Meso- 
Cenozoic (7 = 926). The sampling density (num- 
ber of samples per unit time) is similar for the 
Paleozoic and Meso-Cenozoic, although lower in 
the Proterozoic (Fig. 1B). For Paleozoic samples, 
we targeted carbonate- and fossil-rich (shallow) 
continental-shelf locations, i.e., relatively well- 
oxygenated settings, which are prone to recording 
high I/Ca values. By contrast, many Mesozoic 
data were generated from sections recording 
well-established global oceanic anoxic events, 
which, if anything, would bias that dataset toward 
low values. Cenozoic I/Ca values were measured 
in sediment coarse fractions, which may bet- 
ter preserve primary I/Ca signatures than bulk- 
rock samples. Existing early Cenozoic (14) and 
Cretaceous I/Ca data (13) are generally com- 
parable across different lithologies, although 
comparisons between the Cenozoic and other 
periods are more tenuous. The current dataset 
has relatively denser sample coverage for in- 
tervals coinciding with Earth system pertur- 
bations (e.g., major carbon-isotope excursions 
and mass extinctions) than for extended inter- 
vals with limited environmental changes, but 
this should not influence main features of the 
data compilation. 

I/Ca values can potentially be reduced dur- 
ing subaerial exposure, marine burial, and dolo- 
mitization, but no postdepositional alterations 
are known to increase I/Ca (1D). A variety of dia- 
genesis indicators were considered in previous 
studies of samples that we used here (7, 15-17). 
In all of those case studies, the number of po- 
tentially altered samples was limited and did 
not influence the central trend of most of the 
data through time, as represented by the 25th 
and 75th percentile values (Fig. 1A). High I/Ca 
values throughout the record (Fig. 1A) are not 
consistently tied to a specific inferred primary 
carbonate mineralogy (e.g., calcite versus arag- 
onite seas, fig. S2A). The distinct behaviors of 


Department of Earth Sciences, Syracuse University, Syracuse, NY, 
University of Bristol, Bristol, UK. “Department of Geology and Geo 
Middletown, CT, USA. °Department of Earth and Environmental Sc 
and School of Earth Science, China University of Geosciences, Wu 
China. °Department of Geology, University of Cincinnati, Cincinnat 
Polytechnic and State University, Blacksburg, VA, USA. ‘School o 


USA. “Department of Earth Sciences, University of California, Riverside, Riverside, CA, USA. School of Geographical Sciences, 
physics, Yale University, New Haven, CT, USA. Department of Earth and Environmental Sciences, Wesleyan University, 
iences, Michigan State University, East Lansing, MI, USA. State Key Laboratory of Biogeology and Environmental Geology 
han, China. °State Key Laboratory of Geological Processes and Mineral Resources, China University of Geosciences, Wuhan, 
i, OH, USA. !°School of Earth Sciences, The Ohio State University, Columbus, OH, USA. "Department of Geosciences, Virginia 
Earth and Space Sciences, University of Science and Technology of China, Hefei, China. State Key Laboratory for Mineral 


Deposits Research, School of Earth Sciences and Engineering, Nanjing University, Nanjing, China. “Department of Geological and Environmental Sciences, Appalachian State University, Boone, 
NC, USA. Department of Geosciences, University of Alaska, Fairbanks, Fairbanks, AK, USA. Geology Department, State University of New York College at Cortland, Cortland, NY, USA. 


Department of Earth Sciences, University of Oxford, Oxford, UK. 


Geosciences, Pennsylvania State University, University Park, PA, USA. 


*Corresponding author. Email: zunlilu@syr.edu 


Lu et al., Science 361, 174-177 (2018) 13 July 2018 


18Earth System Science Group, College of Life and Environmental Sciences, University of Exeter, Exeter, UK. Department of 


1 of 4 


8L0z ‘S| Ainr uo /Bi0'beweousal0se0ual!0s//:dijy wos} pepeojumoq 


PEJIU3 NOATOTOBUJIA TPYNNA "What's News" VK.COM/WSNWS 


RESEARCH | REPORT 


I/Ca before and after ~200 Ma ago (Fig. 1A) 
cannot be explained by secular changes in sea- 
water [Ca?*] (fig. S2B). No evidence suggests 
that differences in Paleozoic and Mesozoic I/Ca 
distributions were due to uniformly greater al- 
teration of the Paleozoic samples (fig. S3). Lower 
relative standard deviations of neighboring 
samples in each section (i.e., smoother I/Ca pro- 
files; fig. S4) may reflect better preservation 
of the Paleozoic than the Proterozoic samples 
(fig. S4). 

We interpret I/Ca in marine carbonates pri- 
marily as a qualitative indicator for the depth 


of the oxycline (Fig. 2), that is, that part of the 
water column where the [O.] decreases rela- 
tively abruptly. Carbonate rocks formed in the 
upper ocean record surface or near-surface sea- 
water [IO ], which is strongly affected by the 
presence or absence of a proximal oxygen mini- 
mum zone (OMZ) or a shallow oxycline. Owing 
to the relatively slow oxidation kinetics of I” (78), 
surface waters may retain a low IO, signal 
despite high in situ [O.]. For instance, core- 
top (modern) planktonic foraminiferal shells 
exhibit low I/Ca values (~0.5 umol/mol) in waters 
above a shallow OMZ in the equatorial Pacific 
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Fig. 1. Carbonate I/Ca through time. (A) Candlestick plot showing ranges of |/Ca values for 
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Fig. 2. Schematic illustra- 
tions for the evolution 

of oxygenation conditions. 
These simplified cartoons 
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all temporal and spatial 
variations. (A) Proterozoic. 
(B) Paleozoic. (C) Mesozoic 
and Cenozoic. 
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but record higher values (>3 umol/mol) at other 
well-oxygenated locations (6). 

The large I/Ca excursion during the Devonian 
(Fig. 3) most likely reflects deepening of the 
oxycline and development of better oxygenated 
conditions in the upper ocean, consistent with 
published proxy data and modeling results 
(4, 5, 19, 20). Although different box models 
yield somewhat divergent interpretations of 
atmospheric Po, variation through the Phan- 
erozoic (21-24), a Devonian rise in Pos levels 
is plausible, on the basis of the carbon-oxygen- 
phosphorus-sulfur evolution (COPSE) model and 
charcoal proxy reconstructions (Fig. 3A), and 
was most likely due to increased abundance of 
vascular land plants (19, 20). Previous work in- 
terpreted 6°°Mo, iron-speciation, and biological 
data (Fig. 3C) to reflect oceanic redox changes, 
supporting the idea of atmospheric Pos rise 
during the Devonian (4, 5). The combination 
of these independent proxies indicates that 
the Devonian atmospheric Po, rise affected the 
whole atmosphere-ocean system, across the en- 
tire redox spectrum (Fig. 3). 

1/Ca values returned to Proterozoic-like lev- 
els after the transient Devonian excursion, but 
there is no evidence for a Pog decrease to pre- 
Devonian levels between the Carboniferous and 
the Triassic. Instead, the post-Devonian atmo- 
sphere was probably relatively O, rich (Fig. 3A). 
High atmospheric Po, likely altered terrestrial 
weathering feedbacks and enhanced nutrient 
delivery to the ocean (25, 26), leading to inten- 
sified O. consumption in the upper ocean, a 
generally shallow oxycline, and low I/Ca values 
between ~400 and 200 Ma ago (Fig. 3B). Under 
such conditions, carbonates formed in surface 
oceans rapidly equilibrated with the high-Po, 
atmosphere would record low in situ [IO; ] be- 
cause of the slow oxidation of I” during mixing 
between surface and subsurface waters (Fig. 2B). 
If the oxycline were indeed shallow, marine ani- 
mals on continental shelves at that time (~400 to 
200 Ma ago) would have been living in a thin 
layer of well-oxygenated surface water directly 
underlain by an OMZ (Fig. 2B). Our dataset 
(Fig. 1A) implies that well-oxygenated upper- 
ocean conditions became persistent and resil- 
ient only by the Triassic-Jurassic (~200 Ma ago), 
much later than previously inferred (27). The 
prerequisites for achieving such well-oxygenated 
upper-ocean conditions are a combination of high 
atmospheric Poy and a generally deep oxycline 
(Fig. 2C). The position of the oxycline is strong- 
ly controlled by the depth of organic-matter 
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Fig. 3. Phanerozoic 
1/Ca compared with 
atmospheric Poo, 
oceanic oxygenation, 
and animal evolution 
trends. (A) Modeled 


(5, 21, 23, 24) in 
comparison with the 
charcoal proxy record 
(20). Dashed lines 
(5) mark a broadly 
defined ocean- 
atmospheric Oz level, 
not just atmospheric 
Poo. (B) I/Ca records 
through the Phanero- 
zoic. Blue boxes 


rock; gray boxes 
indicate bulk coarse 
fraction of Ocean 
Drilling Project 
samples (>63 ym). 
(C) Marine animal 
body-size record (36). 
The thickness of 

the green bars indi- 
cates relative generic 


literature (31, 32). 
The red versus blue 


versus icehouse 
climate conditions, 
respectively. Cm, 


S, Silurian; D, Devonian; 
C, Carboniferous; 

P, Permian; T, Triassic; 
J, Jurassic; K, Creta- 
ceous; Pg, Paleogene; 
N, Neogene. 
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Fig. 4. Residual sum of squares and subsurface [O2] at different Po2z levels and remineralization 
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remineralization, which is dependent on the effi- 
ciency of organic-matter export from the photic 
zone, and has been proposed as a governing pa- 
rameter for OMZs during the Phanerozoic (28). 

We hypothesize that changes in remineraliza- 
tion of organic matter strongly influenced the 
upper-ocean I/Ca signature (Fig. 2), and we 
tested this hypothesis by simulating the marine 
iodine cycle in the cGENIE Earth system model 
(see methods and fig. S5) (29). We aimed to 
identify possible causes for low I/Ca during the 
Paleozoic through ensembles of model runs using 
a range of values for atmospheric Pos, the depth 
of organic-matter remineralization in the water 
column, and the mean concentrations of iodine 
and phosphate in seawater. For each Paleozoic 
model run, surface-water [IO; ] values along 
continental margins were extracted to calculate 
a relative frequency distribution (fig. S6). The 
modeled IO, distributions were compared with 
observed Paleozoic I/Ca distributions (Fig. 1C) to 
obtain the residual sum of squares (RSS) (see 
materials and methods, Fig. 4A, and fig. S7). 

We found that the lowest RSS values (<0.05), 
representing the best data-model fits, were 
achieved at shallow remineralization depths 
[i.e., <0.5 present oceanic level (POL)]. In the 
same set of cGENIE runs (Fig. 4B), lower RSS 
values correlated with lower average [O.] in 
the subsurface layer (80 to 176 m), which is con- 
sistent with a shallower oxycline. Even as Pa- 
leozoic oceans experienced transitions between 
greenhouse and icehouse climate conditions, Pcoz 
levels appear to have had minimal influence on 
10; distributions (fig. S8). Global-scale changes 
in ocean circulation and continent configuration 
also do not notably influence the oxycline depths 
independently of Pog and subsurface oxygen 
consumption (fig. S9). The RSS contours differed 
only slightly when the Paleozoic I/Ca distribu- 
tion was compared with modeled [IO3 ] distribu- 
tions in the top four layers in the upper ocean 
(from 0 to 410 m, fig. S10). Thus, a lack of precise 
constraints on the paleodepths of carbonate 
formation is unlikely to have affected the main 
conclusions of our data-model comparison. 
Additional model runs also suggest that oceanic 
nutrient concentrations and total iodine con- 
centrations are unlikely to dominate the secu- 
lar trends in proxy data (figs. S11 and S12). Our 
data-model comparison (Fig. 4A) should not 
be viewed as a precise estimate of the atmo- 
spheric Po» for any single time slice, because the 
data were compiled over the entire Paleozoic 
under varying Po, levels. Thus, the lower RSS 
values at Poy below 1 present atmospheric level 
(PAL) suggest that some portions of the Paleo- 
zoic may have had Po, levels lower than those of 
today (5, 23). 

On the basis of our data compilation and model 
analyses, we attribute the transition at ~200 Ma 
ago from Proterozoic-like low I/Ca values in 
the Paleozoic (except for the mid-Devonian) to 
modern-like high values in the Meso-Cenozoic 
to a profound increase in the average remineral- 
ization depth of organic matter in the water col- 
umn. The timing of this transition is consistent 
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with the proliferation of eukaryotic phyto- and 
zooplankton after the Permian-Triassic extinc- 
tion (Fig. 3A) (30, 31), which eventually shaped 
the ecological landscape of the modern ocean 
(32). The larger size of primary producers (33), 
grazing and repackaging of organic matter into 
fecal pellets (34), and/or the advent of mineral- 
ized plankton (32) may have led to faster sinking 
of organic matter, which reduced O, utilization 
in the upper water column and caused a perva- 
sive deepening of the oxycline (28). 

The rise of oxygen levels over geological time 
has been linked to increases in animal body size 
(24, 35). A comprehensive compilation of Phan- 
erozoic marine animal body-size data (36) shows 
that maximum biovolume probably covaried with 
I/Ca to some extent (Fig. 3B), indicating that O, 
availability in the global upper ocean may have 
been an important factor in Phanerozoic meta- 
zoan evolution. New forms of organisms (e.g., 
mineralized plankton, larger animals) fundamen- 
tally influenced oceanic environments, which in 
turn affected the evolving biosphere, represent- 
ing a prime example of the coevolution of life 
and planet. 


REFERENCES AND NOTES 


1. A.H. Knoll, Cold Spring Harbor Perspect. Biol. 6, a016121 
(2014). 

2. R.E. Keeling, A. Kortzinger, N. Gruber, Ann. Rev. Mar. Sci. 2, 
199-229 (2010). 

3. T. W. Lyons, C. T. Reinhard, N. J. Planavsky, Nature 506, 
307-315 (2014). 

4. T.W. Dahl et al., Proc. Natl. Acad. Sci. U.S.A. 107, 17911-17915 
(2010). 

5. E. A. Sperling et al., Nature 523, 451-454 (2015). 


Lu et al., Science 361, 174-177 (2018) 13 July 2018 


=] 


= 


no 


co 


Z. Lu et al., Nat. Commun. 7, 11146 (2016). 
Z. Lu, H. C. Jenkyns, R. E. M. Rickaby, Geology 38, 1107-1110 
(2010). 
G. W. Luther Ill, T. Campbell, Deep-Sea Res. A, Oceanogr. 
Res. Pap. 38, S875-S882 (1991). 
E. L. Rue, G. J. Smith, G. A. Cutter, K. W. Bruland, Deep Sea 
Res. Part | Oceanogr. Res. Pap. 44, 113-134 (1997). 

. J. Podder et al., Geochim. Cosmochim. Acta 198, 218-228 
(2017). 


. D. S. Hardisty et al., Earth Planet. Sci. Lett. 463, 159-170 


(2017). 


. D. S. Hardisty et al., Geology 42, 619-622 (2014). 
. X. Zhou et al., Paleoceanography 30, 510-526 (2015). 
. XL. Zhou, E. Thomas, R. E. M. Rickaby, A. M. E. Winguth, 


Z. L. Lu, Paleoceanography 29, 964-975 (2014). 


. W. Lu et al., Geochem. Perspect. Lett. 5, 53-57 (2017). 
. C. T. Edwards, D. A. Fike, M. R. Saltzman, W. Lu, Z. Lu, 


Earth Planet. Sci. Lett. 481, 125-135 (2018). 


. GR. Loope, L. R. Kump, M. A. Arthur, Chem. Geol. 351, 


195-208 (2013). 


. R. Chance, A. R. Baker, L. Carpenter, T. D. Jickells, Environ. Sci. 


Process. Impacts 16, 1841-1859 (2014). 


. T. M. Lenton et al., Proc. Natl. Acad. Sci. U.S.A. 113, 


9704-9709 (2016). 
|. J. Glasspool, A. C. Scott, Nat. Geosci. 3, 627-630 
(2010). 


. R.A. Berner, Geochim. Cosmochim. Acta 70, 5653-5664 


(2006). 

. N. M. Bergman, T. M. Lenton, A. J. Watson, Am. J. Sci. 304, 
397-437 (2004). 

. T. M. Lenton, S. J. Daines, B. J. Mills, Earth Sci. Rev. 178, 1-28 

(2018). 

P. G. Falkowski et al., Science 309, 2202-2204 (2005). 

. LR. Kump, Nature 335, 152-154 (1988). 

T. J. Algeo, R. A. Berner, J. B. Maynard, S. E. Scheckler, 

GSA Today 5, 64-66 (1995). 

. H. D. Holland, Philos. Trans. R. Soc. London B Biol. Sci. 361, 
903-915 (2006). 

. K. M. Meyer, A. Ridgwell, J. L. Payne, Geobiology 14, 207-219 
(2016). 

. A. Ridgwell et al., Biogeosciences 4, 87-104 (2007). 


30. M. E. Katz, Z. V. Finkel, D. Grzebyk, A. H. Knoll, P. G. Falkowski, 


Annu. Rev. Ecol. Evol. Syst. 35, 523-556 (2004). 
31. R. E. Martin, Global Planet. Change 11, 1-23 (1995). 
32. P. G. Falkowski et al., Science 305, 354-360 (2004). 
33. N. J. Butterfield, Geobiology 7, 1-7 (2009). 
34. G. A. Logan, J. M. Hayes, G. B. Hieshima, R. E. Summons, 
Nature 376, 53-56 (1995). 
35. J. L. Payne et al., Photosynth. Res. 107, 37-57 (2011). 


36. N. A. Heim, M. L. Knope, E. K. Schaal, S. C. Wang, J. L. Payne, 


Science 347, 867-870 (2015). 
37. W. Lu et al., (Table S2) |/Ca record of phanerozoic 
carbonates. PANGAEA (2018). 


ACKNOWLEDGMENTS 


Funding: We acknowledge funding from grants NSF EAR-1349252, 


OCE-1232620, and OCE-1736542 (Z.L.); NERC NE/JO1043X/1, 
ERC-2013-CoG-617313, and NSF OCE-1736771 (A.R.); 

NSF OCE-1736538 (E.T.); National Key R&D Project of China 
2016YFA0601104 and NSFC 41290260 (G.L.); NSFC 

41330102 and 41721002 and the “111” project (Y.S.); 

Wolfson Research Merit award from the Royal Society and 

EU award (R.E.M.R.); and NERC NE/NO18508/1 and NE/P013643/ 
1(T.M.L.). Author contributions: Z.L. conceived the study. 

W.L. and Z.L. carried out the data analysis, model simulations, 

and wrote the paper. A.R. modified cGENIE to incorporate 

the iodine cycle. All authors contributed to data interpretation 

and the writing of the manuscript. Competing interests: The authors 
declare that they have no competing interests. Data and 
materials availability: All |/Ca data are available in the 
supplementary materials. Table S2 is additionally provided 

online at www.pangaea.de/ (37). 


SUPPLEMENTARY MATERIALS 


www.sciencemag.org/content/361/6398/174/suppl/DC1 
Materials and Methods 

Figs. S1 to S12 

References (38-52) 

Tables S1 and S2 


20 November 2017; accepted 17 May 2018 
Published online 31 May 2018 
10.1126/science.aar5372 


4 of 4 


8L0z ‘S| Aine uo /Bio'bewsousl0seoua!0s//:dyjy wos} papeojumoq 


PENU3 NOAFOTOBWIA FPYNNA "What's News" VK.COM/WSNWS 


RESEARCH 


EVOLUTIONARY COGNITION 


Sensitivity to “sunk costs” 
in mice, rats, and humans 


Brian M. Sweis’”, Samantha V. Abram’, Brandy J. Schmidt”, Kelsey D. Seeland”, 
Angus W. MacDonald III*, Mark J. Thomas”*, A. David Redish?* 


Sunk costs are irrecoverable investments that should not influence decisions, because 
decisions should be made on the basis of expected future consequences. Both human 
and nonhuman animals can show sensitivity to sunk costs, but reports from across 
species are inconsistent. In a temporal context, a sensitivity to sunk costs arises when 
an individual resists ending an activity, even if it seems unproductive, because of the 
time already invested. In two parallel foraging tasks that we designed, we found that 
mice, rats, and humans show similar sensitivities to sunk costs in their decision-making. 
Unexpectedly, sensitivity to time invested accrued only after an initial decision had been 
made. These findings suggest that sensitivity to temporal sunk costs lies in a vulnerability 
distinct from deliberation processes and that this distinction is present across species. 


raditional economic theory suggests that 
decisions should be based on valuations of 
future expectations that ignore spent re- 
sources that cannot be recovered [sunk costs 
(D]. However, extensive evidence shows that 
humans factor such sunk costs into prospective 
decisions, even when faced with better alterna- 
tives (2, 3). Although early reports claimed that 
humans are uniquely sensitive to sunk costs 
(2, 3), it is becoming increasingly clear that non- 
human animals exhibit parallel behaviors (4). 
Previous nonhuman animal studies that at- 
tempted to model the sunk cost phenomenon 
have yielded conflicting evidence (4, 5). Observa- 
tional and experimental field studies in swal- 
lows, sparrows, mice, and bluegills have found 
evidence both for and against the sunk cost ef- 
fect in behaviors relating to parental investment 
and willingness to care for young (6-10). Yet in 
such studies, it has been difficult to disentangle 
influences of investment history from those of 
future prospects. Laboratory operant conditioning 
paradigms in pigeons and rats that control for 
future expectations when looking at reinforce- 
ment learning behaviors have demonstrated that 
nonhuman animals show increased work ethic 
or suboptimal perseverative reward-seeking behav- 
iors that escalate with prior investment amount 
(11, 12). However, these observations often relied 
on situations of information uncertainty, where 
subjects overworked in the absence of progress- 
indicating cues. These observations also often 
relied on automation or habit-like behaviors 
(e.g., repetitive lever pressing) driving contin- 
ued reward pursuit. Such confounding factors 
in nonhuman animal studies obscure translation 
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to human sunk cost effects, which do not depend 
on these mechanisms. 

Laboratory foraging tasks provide an alterna- 
tive approach to study decision-making by using 
naturalistic behaviors that carry both ecological 
validity and evolutionary significance and are 
translatable across species (13). Foraging tasks 
rely on optimizing reward-seeking under condi- 
tions of limited resources, making them econom- 
ic tasks. 

We designed a foraging task in which subjects 
spent time from a limited time budget waiting 
for rewards [Fig. 1, Restaurant Row (/4) and web- 
Surf (15)]. The division of time spent during the 
task reveals the economic preferences of the sub- 
jects. All three species learned to forage in a way 
that revealed preferences for certain rewards, 
and all species used reliable subjective valuation 
strategies to decide between multiple competing 
reward offers (supplementary text S1). Our neuro- 
economic tasks directly test sensitivity to sunk 
costs across species. 

Flavors and genres of rewards (Fig. 1) allowed 
us to measure subjective preferences as a func- 
tion of cost, avoiding the confounding possibility 
that different reward sizes might require differ- 
ent consumption times. Multiple zones allowed 
us to characterize multiple valuation processes 
involved in decisions: initial commitment valu- 
ations (offer zone), secondary reevaluations (wait 
zone), and postconsumption hedonic valuations 
(supplementary text S1 and $2). In this task, two 
key factors minimized information uncertainty 
and automated reward-seeking behavior as po- 
tential confounding factors: (i) Subjects were pro- 
vided full information on cost and investment 
progress (tones counting down or download bar 
shrinking), and (ii) earning rewards required sub- 
jects to wait and withhold quitting after making 
an initial acceptance decision (rather than re- 
quiring a repetitive action). 

To address susceptibility to sunk costs, we 
examined quit decisions in the wait zone. These 
behaviors involve the abandonment of continued 


reward pursuit despite having made prior invest- 
ments (partial waiting) while on a limited budget 
(time). We parameterized the probability of earn- 
ing a reward in the wait zone as a function of the 
remaining time investment required to earn a re- 
ward (future costs) and the prior time investment 
already spent waiting in the wait zone [past 
(sunk) costs; fig. S3]. The data yielded many 
samples across all conditions of time remain- 
ing and time spent (fig. S4), which allowed us to 
measure the extent to which irrecoverable prior 
investments (sunk costs) escalated wait zone com- 
mitment (fig. S5). 

We found that mice, rats, and humans demon- 
strated robust sunk cost effects [analysis of vari- 
ance (ANOVA) collapsing across all sunk cost 
conditions: mice, F = 30.75, P < 0.0001; rats, F = 
45.65, P < 0.0001; humans, F = 3.95, P < 0.0001] 
(Fig. 2). Importantly, increasing prior invest- 
ment amounts generated a continuously stronger 
sunk cost effect (example post-hoc comparison 
between +1-s and +5-s sunk costs: mice, F = 45.49, 
P< 0.0001; rats, F = 54.41, P < 0.0001; humans, 
F= 4,21, P < 0.05) (Fig. 2)—a critical tenet of the 
sunk cost fallacy (2, 4). 

Time spent in the offer zone also detracts 
from the total time budget, and a similar analysis 
can be performed (fig. S6). In contrast to our 
findings for reevaluation processes in the wait 
zone, we found no effect of time spent in the 
offer zone. That is, the amount of time spent in 
the offer zone did not alter the probability of 
earning rewards once in the wait zone (ANOVA 
collapsing across all offer conditions: mice, F = 
1.55, P = 0.23; rats, F = 0.77, P = 0.39; humans: 
F = 0.12, P = 0.74) (Fig. 3). Importantly, the de- 
lay to reward did not start counting down while 
the subject remained in the offer zone. This 
meant that the animal was choosing between 
distant options and had not yet invested in the 
offer. This lack of an effect of time spent in the 
offer zone on progress abandonment once com- 
mitted suggests that waste avoidance, overall 
resource depletion, and loss aversion are insuf- 
ficient explanations of sunk cost-driven esca- 
lation of reward-seeking behavior (figs. S7 to 
S11, supplementary text S2 and S83, and table S1). 
This also suggests that the offer zone and wait 
zone may access separable valuation processes 
and reveals a previously unknown determinant of 
susceptibility to sunk costs rooted in dissociable 
decision-making algorithms that are conserved 
across species. 

A sensitivity to sunk costs defies optimality 
considerations (fig. S12). So, why has this cog- 
nitive bias persisted across evolution (supple- 
mentary text S4)? Three plausible psychological 
mechanisms that support sunk cost biases in- 
clude (i) that it may be more advantageous to 
calculate reward value through effort expended, 
(ii) state-dependent valuation learning (SDVL), 
and (iii) within-trial contrast (WTC) processes 
(12, 16-20). We discuss each of these below. 

Because predicting valuations that depend on 
future outcomes is complex and difficult, ani- 
mals may have evolved processes in which val- 
uation is measured from effort spent rather than 
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calculated explicitly as an estimate from con- 
structed imaginations of potential future out- 
comes. Past effort is easy to measure but has a 
limited (though nonzero) correlation with future 
value. In contrast, calculating value from ex- 
pected future outcomes has its own estimation 
uncertainties. If the correlation between past ef- 
forts and future value provides better predictions 
than the uncertainties of future outcomes, then 
animals may have evolved processes that use past 
effort as a proxy to estimate future value (16, 19). 
This can explain our observation that the postcon- 
sumption evaluation increases proportionally to 
the time spent waiting for the reward in all three 
species (fig. S13 and supplementary text S5). 
The fact that susceptibility to sunk costs only ac- 
crued in the wait zone implies that valuations in 
the offer zone depend on different processes 
that do not include measures of effort spent, but 
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that may be more related to direct estimates of 
future value. 

The SDVL theory hypothesizes that energy 
spent working toward reward receipt moves the 
individual into a poorer energy state, enhancing 
the perceived value of the yet-to-be-obtained re- 
ward (19, 20). This continued work can thus es- 
calate commitment of continued reward pursuit 
with growing sunk costs. Similarly, the WTC 
theory describes the sunk cost phenomenon as 
an increasing contrast between the decision- 
maker’s current physical state and the goal (27). 
SDVL and WTC propose that either physiological 
or psychological states could drive added value, 
leading to a susceptibility to sunk costs. How- 
ever, we did not observe sunk costs accruing 
during the offer zone, even though time spent in 
the offer zone is equivalent in physical and cog- 
nitive demands to time spent in the wait zone. 


Simple explanations from the WTC and SDVL 
theories would predict sunk costs to accrue in 
the offer zone as well. 

Past-effort heuristics, SDVL, and WTC can in- 
deed be prominent drivers of the sunk cost effect 
in our data when sunk cost effects are present. 
Therefore, our work brings up an intriguing ques- 
tion: How do decision-making processes differ 
between the wait zone (susceptible to sunk costs) 
and the offer zone (not susceptible to sunk costs)? 

One possibility is that decisions made in the 
offer zone and wait zone may rely on separate 
processes that calculate value in distinct ways 
through dissociable neural circuits (22-24). Re- 
cent findings from other foraging tasks suggest 
that choosing to remain committed to already 
accepted options accesses different valuation al- 
gorithms than deliberating between distant op- 
tions (16, 25-27). We suggest that wait zone 


Fig. 1. Task schematics. (A) In the Restaurant 

Row task, food-restricted rodents were trained on 

a maze in which they encountered serial offers 

for flavored rewards in four “restaurants.” Each restaurant 
contained a separate offer zone and wait zone. 

Tones sounded in the offer zone; a fixed tone pitch 
indicated the delay for which rodents would have 

to wait in the wait zone (1 to 30 s, random 

on offer entry). Tone pitch descended in the wait 


zone during the delay “countdown.” Rodents 
could quit the wait zone for the next restaurant 
during the countdown, terminating the trial. (B) In 
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the web-Surf task, humans performed an analogous 

1 30-min computer-based foraging paradigm in 

i which they encountered serial offers for short 
entertaining videos from four “galleries.” A static 

f “download bar” appeared in the offer phase indicating 
: delay length (1 to 30 s, random on offer entry), 

1 which did not begin downloading until after 

entering the wait phase. Downloads could be quit 

| during the wait phase. Humans were also asked 

to rate each video on a scale from 1 (least enjoyable) 


to 4 (most enjoyable) after viewing and to rank the 
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Fig. 2. The amount of time spent waiting 
increases commitment to continuing reward 
pursuit in mice, rats, and humans. (A to 

C) Probability of earning a reward in the wait 

zone as a function of countdown time remaining 
in (A) mice, (B) rats, and (C) humans. Black 

data points indicate trials in which subjects had 
just entered the wait zone. Colored data points 
indicate time remaining in the countdown 

after subjects had already waited varying times 
fig. S3). Linear regressions are plotted with 95% 
confidence interval shadings. (D to F) Slopes 
calculated from each linear regression in (A) to 
C) (“observed”) and slopes recalculated iteratively 
from black data points to match colored data 
ranges in (A) to (C) (“adjusted controls”) 

fig. S5) are plotted +1 SEM. The colored tick 

on the x axis indicates time in the wait zone until 
the first significant sunk cost effect was observed. 
ANOVAs were used to compare slopes of linear 
regression models, testing for interactions 

with sunk cost conditions and controls, correcting 
for multiple comparisons. Not significant (n.s.), 

P > 0.05; *P < 0.05; **P < 0.01; ***P < 0.001; 
RESEP <= O:0001. 


Fig. 3. Resources spent while deliberating do 
not contribute to the sunk cost effect. (A to 

C) Amount of time spent in the offer zone 
choosing to skip versus to enter did not influence 
the probability of earning versus quitting once 

in the wait zone after subjects chose to enter 

(fig. S6). Linear regressions are plotted with 95% 
confidence interval shadings. (D) Slopes calculated 
from linear regressions are plotted +1 SEM and 
are not significantly different from each other or 
zero in mice (F = 1.545, P = 0.229), rats (F = 0.767, 
P = 0.392), or humans (F = 0.117, P = 0.737), on 
the basis of an ANOVA with post-hoc comparisons 
against zero. n.s., P > 0.05. 
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decisions are driven by distinct mechanisms that 
depend on recently accumulated states, whereas 
offer zone decisions are driven by deliberation 
mechanisms that simulate future outcomes con- 
structed from a more extended knowledge base 
of past experiences. There is strong neural evi- 
dence across these species to suggest that com- 
peting simulated future alternatives are being 
represented, evaluated, and compared in de- 
liberation algorithms, whereas other decision- 
making systems depend on more immediate 
sensory signals that likely include interocep- 
tive signals of effort expended and representa- 
tions of internal state (23-24, 28-3D) (fig. S8 and 
supplementary text S3). Each of these decision- 
making systems provides computational advan- 
tages better suited for different situations. Thus, 
these multiple valuation algorithms can each 
confer independent evolutionary advantages and 
can coexist and persist across time and species 
(22-24). 

The sunk cost fallacy, by definition, arises from 
valuing spent resources that cannot be recovered. 
Our data finds that these sunk costs only accrue 
under specific situations in mice, rats, and hu- 
mans. We suggest that multiple, parallel decision- 
making valuation algorithms implemented in 
dissociable neural circuits have persisted across 
species and over time through evolution. Our 
data imply that these different valuation algo- 
rithms are differentially susceptible to sunk costs. 
Past studies that reported conflicting findings 
across species may have failed to consider how 
different decision systems drive behavior (fig. S14: 
and supplementary text S6 and S7). Studies iden- 
tifying differences in sensitivity to sunk costs 
should consider which decision-making pro- 
cesses are being accessed by the individual in a 
given task. Because these processes could change 
between species, or within species but across 
aging, stages of development, or circumstances, 
so too could sensitivity to sunk costs. 
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Using a translational approach in mice, rats, 
and humans, we find direct evidence in parallel 
tasks that the sunk cost phenomenon is con- 
served across species. Our findings highlight the 
utility of economic paradigms that can dissociate 
decision-making computations, using natural- 
istic tasks that are translatable across various 
species and that can be expanded to survey in- 
dividuals of varying ages or psychiatric popula- 
tions. These tasks and findings may aid future 
research in education or neuropsychiatry by 
shedding light on diagnostic or intervention strat- 
egies and revealing the roles of neurally distinct 
decision systems. 
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Ethylene-gibberellin signaling 
underlies adaptation of rice 
to periodic flooding 


Takeshi Kuroha’*}, Keisuke Nagai?*, Rico Gamuyao”*{, Diane R. Wang**§, 

Tomoyuki Furuta”, Masanari Nakamori’, Takuya Kitaoka”, Keita Adachi’, 

Anzu Minami”, Yoshinao Mori”, Kiyoshi Mashiguchi’, Yoshiya Seto’||, 

Shinjiro Yamaguchi’, Mikiko Kojima*, Hitoshi Sakakibara*’, Jianzhong Wu*, 

Kaworu Ebana’, Nobutaka Mitsuda®, Masaru Ohme-Takagi®®, Shuichi Yanagisawa’®, 
Masanori Yamasaki", Ryusuke Yokoyama’, Kazuhiko Nishitani'’, Toshihiro Mochizuki’”, 
Gen Tamiya’®"*, Susan R. McCouch*+, Motoyuki Ashikari”+ 


Most plants do poorly when flooded. Certain rice varieties, known as deepwater rice, survive 
periodic flooding and consequent oxygen deficiency by activating internode growth of stems to 
keep above the water. Here, we identify the gibberellin biosynthesis gene, SD1 (SEMIDWARF1), 
whose loss-of-function allele catapulted the rice Green Revolution, as being responsible for 
submergence-induced internode elongation. When submerged, plants carrying the deepwater 
rice—specific SD1 haplotype amplify a signaling relay in which the SD1 gene is transcriptionally 
activated by an ethylene-responsive transcription factor, OsEILla. The SD1 protein directs 


increased synthesis of gibberellins, largely GA, 


, which promote internode elongation. 


Evolutionary analysis shows that the deepwater rice—specific haplotype was derived from 
standing variation in wild rice and selected for deepwater rice cultivation in Bangladesh. 


eepwater rice varieties, grown mainly in 
Asian lowland areas, respond to months- 
long deep flooding by increasing plant 
height as water levels rise. The deepwater 
growth response is due to rapid internode 
elongation, allowing leaves to remain above the 
water surface; internode elongation is accom- 
plished by both cell proliferation and elonga- 
tion (7). Only SVORKEL] and SNORKEL? (SK1/2), 
encoding transcription factors with high similar- 
ity to the flash flood-tolerant regulator SUB1A (2), 
have previously been identified as causal genes for 
the deepwater response (3). Although ethylene and 
gibberellins are thought to trigger this response 
(1, 4, the molecular mechanism(s) underlying 
the interaction among these hormones during 
internode elongation remains elusive. 

To identify factors regulating the deepwater 
response in rice, we carried out a genome-wide 
association study (GWAS) with a diversity panel 
of Asian rice and deepwater rice varieties (table 
S1). We measured nine deepwater traits (Fig. 1A, 
fig. S1, and table S2) and selected total internode 
length measured 7 days after submergence at 
the 10-leaf stage (TILS) as a representative trait 
(Fig. 1B and figs. S2 and S3). The GWAS with the 
TILS value as a proxy for the deepwater response 
(figs. S4 to S6) revealed six quantitative trait loci 


(QTLs) exceeding the significance threshold (Fig. 1C 
and table S3). The GWAS peak on chromosome 1 
was located within a QTL, g7TIL1?*", which we 
had detected in a previous analysis of the deep- 
water response (5) (fig. S7). To identify the gene(s) 
responsible for g7IL1™*", we conducted high- 
resolution linkage analysis by using a mapping 
population from a cross between near-isogenic 
lines NIL-12 and NIL-1+12, both containing chro- 
mosomal introgressions from deepwater rice 
(C9285 variety) in a nondeepwater rice (T65 variety) 
background (figs. S8 to $10). The candidate region 
was delimited to a 5.5-kb stretch that included the 
OsGA200x2 (Oryza sativa gibberellin 20—oxidase 2) 
gene. Also known as SDI (SEMIDWARF), this 
gene encodes a gibberellin biosynthesis enzyme; 
its null allele produces a semidwarf phenotype and 
was selected during the rice Green Revolution (6) 
(Fig. 1D and fig. S11). Introgression of the C9285 
SD1 allele (SD1°?*") into NIL-12 increased total 
internode length in response to submergence 
(Fig. 1E), and near-constitutive expression of 
sp1°?*> increased internode elongation even 
without submergence (fig. $12). Furthermore, 
NIL-12 with a null allele of SD7 showed shorter 
total internode length in response to submer- 
gence than either NIL-12 or NIL-1+12 (fig. S13). 


These results suggest that SD1°”* is respon- 


sible for g7ILI©?®’, which contributes to the pro- 
motion of internode elongation in response to 
submergence. 

Next, we compared polymorphisms in the GWAS 
panel across the g7ILI©? candidate region and 
identified six SDI haplogroups (fig. $14 and table 
S4). The haplogroup belonging to C9285 (Hap-6) 
had the highest TILS value in the presence of 
SK1/2 (Fig. 2A and fig. S15). Hereafter, we refer 
to the SDI“? haplotype, including the 17 specific 
polymorphisms in the promoter and second intron, 
as the deepwater rice-specific haplotype (DWH) 
(fig. S14). In response to submergence, SD/ tran- 
script accumulation in C9285 was higher than 
that in T65 (Fig. 2B) and diminished from 6 hours 
until 12 hours after submergence, consistent with 
negative feedback of gibberellin suppressing SDI 
transcription (7) (fig. S16). Accumulation of SDI 
transcripts was observed in the elongating inter- 
node (Fig. 2C). NIL-1 also showed induced SDI 
transcript accumulation (figs. S8A and S17); we 
found a significant association between the level 
of SD] transcript accumulation and internode 
elongation in recombinant lines from the linkage 
analysis (fig. S18). Furthermore, varieties harbor- 
ing the DWH and Hap-5 showed higher SDI tran- 
script accumulation than varieties containing 
other SDI haplotypes, regardless of the presence 
of SK1/2 (fig. S19). Our results suggest that the 
DWH potentiates SD/ transcript accumulation 
in response to submergence, independently of 
SK1/2, and that this induction works in synergy 
with SKI/2 to enhance internode elongation in 
response to submergence. 

We next tested SDI transcription in response 
to ethylene (J, 4). Ethylene application showed 
higher SDI transcript accumulation in C9285 and 
NIL-1 than in T65 (Fig. 2D and fig. S20) even in 
the presence of a protein synthesis inhibitor 
(fig. S21), suggesting that ethylene-inducible SD 
transcription is mediated by the DWH and that 
SD1 is regulated by the ethylene signaling pathway 
without de novo protein synthesis. The ethylene 
response involves stabilization and accumulation 
of the transcription factor EIN3 (ETHYLENE 
INSENSITIVE 3) family to activate the transcrip- 
tion of ethylene-responsive genes (8), so we inves- 
tigated the transcriptional regulation of SD] by an 
EIN3 homolog, OsEILia protein (fig. S22). Near- 
constitutive expression of OsEIL1a fused with a 
repression domain (9) resulted in suppression of 
SDI induction in response to ethylene treatment 
(Fig. 2E and fig. S23). A transactivation assay 
demonstrated the direct activation of the Sp?” 
promoter by OsEIL1a, whereas SK1/2 was not able 
to transactivate the SDI" promoter (fig. S24). 
Specific binding of OsEIL1a to the SDI°” promoter 
is supported by assays using a chemical induction 
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Fig. 1. Isolation of SD1 as the gene responsible for the deepwater response. (A) Deepwater response 
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submergence for 7 days. (B) Frequency distribution of the TILS in the GWAS. (C) Manhattan plot for the 


Total internode length (cm) 
nm 
3 
1 


TILS phenotype. The arrowhead indicates the position of the SDI gene; the dotted line is the Bonferroni- é Ps 
corrected 5% significance threshold. (D) The SD1 candidate region identified by high-resolution linkage fs s 
analysis. Chr., chromosome. (E) Gain-of-function analysis for the SD1 gene. Region A of C9285 in (D) was 
transformed into NIL-12 (mean + SD, n = 3 replicates). *P < 0.05 (Student's t test). Arrowheads 
indicate positions of nodes. The boxed area is shown to the right at greater magnification. 
Fig. 2. Contribution of SD1 to internode A »* © PS i a ae B 
elongation in response to submergence via a" eee ee 3 30 
Penna aa g —e— C9285 
ethylene signaling. (A) Box plots of TILS 100 4 \ ®a —a~ T65 
values for six SD1 haplotypes and absence (—) = 0-4 L xo 
or presence (+) of SK1/2. Box edges represent s& + 7 2 
the 0.25 quantile and 0.75 quantile, with median g cal ab £ 
values shown by bold lines; whiskers indicate ee aD B b + 2 
1.5 times the interquartile range. Different letters 20] 2 H —_ Te = s 
denote significant differences according to the 0 a = _ - 
Tukey-Kramer test (P < 0.05). (B) Quantitative ee —F 22 ee 


PCR (qPCR) analysis of SD1 transcription under sub- After submergence (h) 


Haplotype 1 2 
mergence treatment. (C) qPCR analysis of submergence- er D 10 
induced SD1 transcription in different tissue regions C g * @— c9285 
(I to Ill) above the top node (arrow). (D) qPCR analysis $ 2 m— T6S 
of SDI transcription under ethylene treatment. a = 
(E) qPCR analysis determining the effect of OsE/Lla- cen mt 6 & 
SRDX near-constitutive expression on the induction of 1m r 3 - 
SDI transcription by ethylene. Values in (B) to (E) @ 2 
are the mean + SD (n = 3). *P < 0.05 (Student's Oisicih 1 8 2 
t test) versus O hours. (F) Transactivation of | & 
the SD1 promoter by OsEIL1a in rice protoplasts 0 3 6 9 12 
(mean + SD, n = 3). *P < 0.05 (Student's t test) After ethylene treatment (h) 
versus the MYC control. NBS, 35S minimal promoter; E 10 - woh w3h F NBS 
LUC, luciferase. ro . Position from (uc ) a MYC 
3 ATG (bp) -74 +30 
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system, the OsEILla effector fused with an acti- 
vation domain, and a mutant form of OsEILla 
(figs. S25 and S26). Promoter deletion, scanning 
mutagenesis, and synthetic promoter analyses 
revealed that a 13-base pair (bp) sequence of the 
sp1“?"" promoter region (positions —123 to -111 
from the start codon) is required for OsEILIa trans- 
activation (Fig. 2F and fig. S27, A and B). We con- 
firmed direct binding of the OsEIL1 protein to the 
sp1°*> promoter region by an in vitro protein- 
DNA interaction assay (fig. S28). SD1"™ and 
SD1°”* promoters both harbor the 13-bp sequence 
(table S4) and were each activated by OsEILla (fig. 
$27C), suggesting that the regulatory element(s) 
for ethylene-induced transcription is conserved in 
spi’™ and SDI?” promoters. As SD1 transcrip- 
tion in T65 was not induced by ethylene treat- 
ment (Fig. 2D and fig. S20), these observations 
imply the existence of an unknown suppressor‘(s) 
for the ethylene-inducible transactivation. It has 
been reported that GA20-oxidase transcription is 
under negative feedback regulation by a tran- 
scription factor(s) or transcriptional co-regulators 
(10). In deepwater rice harboring the DWH, this 
negative feedback may be partially diminished, 
leading to higher SD/ transcript accumulation 
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relative to that in other rice in response to 
submergence. 

SDI encodes a GA20-oxidase for gibberellin 
biosynthesis (71) (Fig. 3A). GAgo and GAo, the products 
of GA20-oxidases in the early-13-hydroxylation 
pathway (GA, pathway) and non-13-hydroxylation 
pathway (GA, pathway), are converted into bio- 
active gibberellin species GA, and GA,, respectively 
(11). In rice vegetative tissues, endogenous GA, is 
predominant whereas GA, is minor (12). Contrast- 
ingly, in response to submergence, levels of both 
GA, and GA, increased in C9285 but not in T65 
(3). To test whether this increase is due to the 
spi“? harbored by the DWH, internode gib- 
berellin levels were quantified in T65, C9285, and 
NIL-1 under submergence treatment (Fig. 3A). 
Levels of GAgo, GAg, GA;, and GA, increased in 
a time-dependent fashion in both C9285 and 
NIL-1, whereas these trends were not observed 
in T65, suggesting that the DWH contributes to 
the increase in bioactive gibberellin species GA, 
and GA, after submergence. 

The SDI coding region contains two nonsyn- 
onymous single-nucleotide polymorphisms (SNPs) 
in the first and third exons among rice varieties 
(13) (Fig. 3B and fig. S14). We compared enzymatic 


Fig. 3. Contribution of SD1-derived GA, 
for internode elongation. (A) Gibberellin 
levels in elongating internodes under 
submergence treatment (mean + SD, 
n= 3). ox, -oxidase; ND, not detected. 

*P < 0.05 (Student's t test) versus 

O hours. (B) Exonic SNPs of SD1 between 
T65 and C9285. El to E3, exons 1 to 3; 

E, Glu; Q, Gln; G, Gly; R, Arg; A.A., 

amino acid. (C) Enzymatic activities 

of recombinant SD1 proteins for 10-min 
reactions against GAs3 or GAj2 (mean + SD, 
n= 3). MBP, maltose-binding protein. 

*P < 0.05 (Student's t test). (D and 

E) Effects of different active gibberellin 
species (10° M) on internode elongation 
(means + SD,n = 9). Uni, 10°° M 
uniconazole; arrowheads, positions of 
nodes. Different letters denote significant 
differences according to the Tukey-Kramer 
test (P < 0.05). 


activities of recombinant SD1"” and sp1°?*° pro- 
teins in the GA, and GA, pathways. SD1°°?®° 
showed higher enzymatic activity than SD1°® in 
both the GA, (13) and GA, pathways (Fig. 3C, fig. 
$29, and table $5). In the GA, pathway, SD1°9?*° 
displayed enzymatic activity ~270 times as high 
as that of SD1"® and ~19 times as high as that in 
the GA, pathway. Moreover, near-constitutive 
expression of spi? in planta increased GAy and 
GA, levels (fig. S30). Further phenotypic analyses 
revealed that the exonic SNPs and the DWH 
together contribute to internode elongation in 
response to submergence (fig. $31). Taken togeth- 
er, our results indicate that the main activity of the 
SD1©?*° protein is the conversion of GAys into GAg 
in the GA, pathway rather than of GA;3 into GAgo 
in the GA, pathway and that SDI” contributes 
to the increase of the GA, level in response to 
submergence. The reason GA,, in addition to GAy, 
increases in response to submergence in C9285 
can be explained by the existence of GA13-oxidase 
activity in rice internodes (74), involving a meta- 
bolic flow from GAj»2 to GAs; (Fig. 3A). We further 
compared physiological activities of gibberellin 
species for elongation of internodes in T65 and 
C9285 (Fig. 3, D and E). C9285 showed internode 
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elongation in response to gibberellins, whereas 
T65 did not (15) (Fig. 3, D and E). GA, promoted 
internode elongation 10 times as much as GA, did, 
resulting in the highest increase of total plant 
height among the gibberellin species (Fig. 3, D 
and E, and fig. $32). The higher bioactivity of GA, 
than of GA, due to differences in binding affinity 
for the rice gibberellin receptor (16), may amplify 
the effect of gibberellins on cell proliferation and 
the elongation of the internodes in deepwater 
rice (7). Taken together, our results suggest that 
the physiological activity of GA, impacts internode 
elongation and that the increase in GA, levels 
under submergence induces rapid internode 
elongation in deepwater rice. 

We next investigated the genetic relationships 
among deepwater rices. Haplotype analysis of 
149 O. sativa varieties (17) revealed that the 
DWH is present only in Bangladeshi deepwater 
rice (tables S6A and S7A). Principal components 
analysis showed that DWH-positive [DWH(+)] 
varieties with the most accentuated deepwater 
responses (table S6A) belonged to indica, aus, 
and admixed subpopulations (Fig. 4A and figs. 
$33 to S35). Other DWH(+) varieties all clustered 
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within indica (figs. S33B and S34B); nucleotide 
diversity analysis suggested that indica DWH(+) 
varieties underwent a population bottleneck via 
selection for cultivation in Bangladesh (fig. $36). 
Admixture analysis revealed that the DWH orig- 
inated in indica or aus (Fig. 4B). We further in- 
vestigated DWH presence in O. sativa’s wild 
ancestor, Oryza rufipogon. Gene haplotype network 
analysis showed that wild and cultivated rice 
shared the DWH (Fig. 4C and tables S6B and 
S7B). We analyzed wild rice resequencing (fig. 
S37 and tables S6B and S7B) and indel (table S8) 
data revealing 16 DWH(+) wild accessions (table 
S9A). W1 and W4 subpopulations in wild rice 
have been reported to be likely ancestral pop- 
ulations for indica and aus, respectively (18). 
All but one DWH(+) wild accession were clas- 
sified as W1, W4, or W1 admixed (table S9A). Of 
nine known chloroplast haplotypes (78), cpGroup- 
III had the highest DWH frequency (table S9B). 
These cpGroup-III DWH(+) accessions originated 
from Bangladesh (fig. S38A and table S8), sug- 
gesting that the DWH emerged in cpGroup-III 
Bangladeshi wild rice. DWH presence was more 
broadly distributed in wild rice than in cultivated 


Fig. 4. The DWH was derived from 
standing variation in O. rufipogon 
and selected in O. sativa. (A) Distance 
tree of diverse O. sativa varieties in 

the GWAS panel. Black dots, DWH(+) 
varieties. Arrows, varieties with 
accentuated deepwater response. 

(B) Admixture analysis of the 2-Mb 
region flanking SD1 in DWH(+) varieties. 
(C) Haplotype network of the SD1 

gene. Each haplotype is separated 

by mutational changes, with hatches 
indicating mutational differences between 
linked haplotypes. aro, aromatic; tej, 
temperate japonica; trj, tropical japonica; 
adx, admix; wild, O. rufipogon. (D and 

E) Geographical distribution of the 

DWH in O. rufipogon and O. sativa. 

The pie chart size is proportional to 

the number of accessions. 


rice (Fig. 4, D and E); nonetheless, occurrence 
was more frequent in Bangladesh (table S9C). 
Comparison of nucleotide diversity of the SDI 
region in DWH(+) accessions versus DWH(-) 
accessions from the O. rufipogon panel (fig. 
S$38A) resulted in a ratio of ~1 (fig. S38B), pro- 
viding evidence that the DWH evolved first in 
O. rufipogon populations. Three W4 Bangladeshi 
wild accessions clustering in the same clade (fig. 
S38A, red circle) shared all 48 polymorphisms in 
SDI with C9285, suggesting that this group may 
be direct ancestors of Bangladeshi DWH(+) cul- 
tivated deepwater rice (table S10). The absence of 
DWH(+)-diagnostic indels in other wild rice spe- 
cies indicated that the DWH emerged after the 
speciation of O. rufipogon (table S11). Taken to- 
gether, our results support the hypothesis that 
DWH emerged in O. rufipogon during W1-W4 
differentiation; this conditionally functional hap- 
lotype was then a target of selection for the culti- 
vation of O. sativa under deepwater environments 
in Bangladesh. 

On the basis of our present study, we suggest a 
model of signaling pathways underlying rice inter- 
node elongation in response to submergence via a 
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direct molecular link between ethylene signaling 
and gibberellin biosynthesis, ethylene-gibberellin 
relay (fig. S39). We suggest that the DWH medi- 
ates amplification of SD] transactivation via direct 
binding of OsEILIa, resulting in increased GA, 
and GA,; this amplification is key to allowing 
deepwater rice to withstand severe flooding con- 
ditions, in combination with SK1/2. Finally, we 
propose a model of evolution and domestication 
for nondeepwater and deepwater rice whereby 
the DWH emerged in O. rufipogon and was se- 
lected for deepwater rice cultivation in Bangladesh 
(fig. S40). 

As contemporary climate change triggers radi- 
cal shifts in weather patterns, cryptic genetic 
variation found in wild rice gene pools may offer 
adaptive solutions to help breeders fine-tune 
modern rice varieties. Here we reveal that a 
transcriptional gain-of-function allele of the 
Green Revolution semidwarf gene triggers rapid 
stem elongation in deepwater rice, enabling it to 
survive adverse flooding conditions. Thus, the 
same gene has been co-opted several times to 
permit rice cultivation in highly contrasting pro- 
duction systems via different molecular responses— 
decreasing enzymatic activity in one case and 
enhancing transactivation in the other. The ca- 
pacity of SD/ to function in such diverse roles 
in cultivated rice highlights the intrinsic com- 
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plexity and molecular plasticity of plant adap- 
tation strategies. 
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Assessment of methane emissions 
from the U.S. oil and gas supply chain 


Ramon A. Alvarez", Daniel Zavala-Araiza’, David R. Lyon’, David T. Allen”, 

Zachary R. Barkley*, Adam R. Brandt*, Kenneth J. Davis’, Scott C. Herndon’, 

Daniel J. Jacob®, Anna Karion’, Eric A. Kort®, Brian K. Lamb’, Thomas Lauvaux’, 
Joannes D. Maasakkers®, Anthony J. Marchese’®, Mark Omara’, Stephen W. Pacala™, 
Jeff Peischl’”?, Allen L. Robinson’, Paul B. Shepson’”’, Colm Sweeney’’, 

Amy Townsend-Small’®, Steven C. Wofsy®, Steven P. Hamburg™ 


Methane emissions from the U.S. oil and natural gas supply chain were estimated by 
using ground-based, facility-scale measurements and validated with aircraft observations 
in areas accounting for ~30% of U.S. gas production. When scaled up nationally, our 
facility-based estimate of 2015 supply chain emissions is 13 + 2 teragrams per year, 
equivalent to 2.3% of gross U.S. gas production. This value is ~60% higher than the U.S. 
Environmental Protection Agency inventory estimate, likely because existing inventory 
methods miss emissions released during abnormal operating conditions. Methane 
emissions of this magnitude, per unit of natural gas consumed, produce radiative forcing 
over a 20-year time horizon comparable to the COz from natural gas combustion. 
Substantial emission reductions are feasible through rapid detection of the root causes 
of high emissions and deployment of less failure-prone systems. 


ethane (CH,) is a potent greenhouse gas, 

and CH, emissions from human activities 

since preindustrial times are responsi- 

ble for 0.97 W m ° of radiative forcing, 

as compared to 1.7 W m~ for carbon 
dioxide (CO,) (1). CH, is removed from the at- 
mosphere much more rapidly than CO.; thus, 
reducing CH, emissions can effectively reduce 
the near-term rate of warming (2). Sharp growth 
in U.S. oil and natural gas (O/NG) production 
beginning around 2005 (3) raised concerns about 
the climate impacts of increased natural gas use 
(4, 5). By 2012, disagreement among published 
estimates of CH, emissions from U.S. natural 
gas operations led to a broad consensus that 
additional data were needed to better charac- 
terize emission rates (4-7). A large body of field 
measurements made between 2012 and 2016 
(table S1) has markedly improved understanding 
of the sources and magnitude of CH, emissions 
from the industry’s operations. Brandt et al. sum- 
marized the early literature (8); other assessments 
incorporated elements of recent data (9-11). This 
work synthesizes recent studies to provide an 
improved overall assessment of emissions from 
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the O/NG supply chain, which we define to in- 
clude all operations associated with O/NG pro- 
duction, processing, and transport (materials and 
methods, section S1.0) (12). 

Measurements of O/NG CH, emissions can 
be classified as either top-down (TD) or bottom- 
up (BU). TD studies quantify ambient methane 
enhancements using aircraft, satellites, or tower 
networks and infer aggregate emissions from all 
contributing sources across large geographies. 
TD estimates for nine O/NG production areas 
have been reported to date (table S2). These 
areas are distributed across the U.S. (fig. S1) 
and account for ~33% of natural gas, ~24% of oil 
production, and ~14% of all wells (13). Areas 
sampled in TD studies also span the range of 
hydrocarbon characteristics (predominantly gas, 
predominantly oil, or mixed), as well as a range of 
production characteristics such as well produc- 
tivity and maturity. In contrast, BU studies gener- 
ate regional, state, or national emission estimates 
by aggregating and extrapolating measured emis- 
sions from individual pieces of equipment, oper- 
ations, or facilities, using measurements made 
directly at the emission point or, in the case of 
facilities, directly downwind. 

Recent BU studies have been performed on 
equipment or facilities that are expected to rep- 
resent the vast majority of emissions from the 
O/NG supply chain (table S1). In this work, we 
integrate the results of recent facility-scale BU 
studies to estimate CH, emissions from the U.S. 
O/NG supply chain, and then we validate the 
results using TD studies (materials and meth- 
ods). The probability distributions of our BU 
methodology are based on observed facility- 
level emissions, in contrast to the component- 
by-component approach used for conventional 
inventories. We thus capture enhancements pro- 


duced by all sources within a facility, including 
the heavy tail of the distribution. When the BU 
estimate is developed in this manner, direct 
comparison of BU and TD estimates of CH, 
emissions in the nine basins for which TD 
measurements have been reported indicates 
agreement between methods, within estimated 
uncertainty ranges (Fig. 1). 

Our national BU estimate of total CH, emis- 
sions in 2015 from the U.S. O/NG supply chain 
is 13 (+2.1/-1.6, 95% confidence interval) Tg 
CH,/year (Table 1). This estimate of O/NG CH, 
emissions can also be expressed as a production- 
normalized emission rate of 2.3% (+0.4%/-0.3%) 
by normalizing by annual gross natural gas pro- 
duction [33 trillion cubic feet (13), with average 
CH, content of 90 volume %]. Roughly 85% of 
national BU emissions are from production, 
gathering, and processing sources, which are 
concentrated in active O/NG production areas. 

Our assessment does not update emissions 
from local distribution and end use of natural 
gas, owing to insufficient information address- 
ing this portion of the supply chain. However, 
recent studies suggest that local distribution 
emissions exceed the current inventory estimate 
(14-16), and that end-user emissions might also 
be important. If these findings prove to be repre- 
sentative, overall emissions from the natural gas 
supply chain would increase relative to the value 
in Table 1 (materials and methods, section S1.5). 

Our BU method and TD measurements yield 
similar estimates of U.S. O/NG CH, emissions 
in 2015, and both are significantly higher than 
the corresponding estimate in the U.S. Environ- 
mental Protection Agency’s Greenhouse Gas 
Inventory (EPA GHGI) (Table 1 and materials 
and methods, section S1.3) (17). Discrepancies 
between TD estimates and the EPA GHGI have 
been reported previously (8, 18). Our BU esti- 
mate is 63% higher than the EPA GHGI, largely 
due to a more than twofold difference in the 
production segment (Table 1). The discrepancy 
in production sector emissions alone is ~4 Tg 
CH,/year, an amount larger than the emissions 
from any other O/NG supply chain segment. 
Such a large difference cannot be attributed to 
expected uncertainty in either estimate: The 
extremal ends of the 95% confidence intervals 
for each estimate differ by 20% (i.e., ~12 Tg/year 
for the lower bound of our BU estimate can be 
compared to ~10 Tg/year for the upper bound 
of the EPA GHGI estimate). 

We believe the reason for such large divergence 
is that sampling methods underlying conven- 
tional inventories systematically underestimate 
total emissions because they miss high emis- 
sions caused by abnormal operating conditions 
(e.g., malfunctions). Distributions of measured 
emissions from production sites in BU studies 
are invariably “tail-heavy,” with large emission 
rates measured at a small subset of sites at any 
single point in time (19-22). Consequently, the 
most likely hypothesis for the difference be- 
tween the EPA GHGI and BU estimates derived 
from facility-level measurements is that measure- 
ments used to develop GHGI emission factors 


lof3 


8102 ‘S| Ainr uo /Bio'beweoual0seoue!0s//:dyjy wos} pepeojumoq 


PENU3 NOAFOTOBWIA FPYNNA "What's News" VK.COM/WSNWS 


RESEARCH | REPORT 


undersample abnormal operating conditions 
encountered during the BU work. Component- 
based inventory estimates like the GHGI have 
been shown to underestimate facility-level emis- 
sions (23), probably because of the technical 
difficulty and safety and liability risks asso- 
ciated with measuring large emissions from, for 
example, venting tanks such as those observed 
in aerial surveys (24). 

Abnormal conditions causing high CH. emis- 
sions have been observed in studies across the 
O/NG supply chain. An analysis of site-scale emis- 
sion measurements in the Barnett Shale con- 
cluded that equipment behaving as designed 
could not explain the number of high-emitting 
production sites in the region (23). An extensive 
aerial infrared camera survey of ~8000 pro- 
duction sites in seven U.S. O/NG basins found 
that ~4% of surveyed sites had one or more 
observable high-emission rate plumes (24) (de- 
tection threshold of ~3 to 10 kg CH,/hour was 
two to seven times higher than mean produc- 
tion site emissions estimated in this work). Emis- 
sions released from liquid storage tank hatches 
and vents represented 90% of these sightings. 
It appears that abnormal operating conditions 
must be largely responsible, because the obser- 
vation frequency was too high to be attributed 
to routine operations like condensate flashing 
or liquid unloadings alone (24). All other ob- 
servations were due to anomalous venting from 
dehydrators, separators, and flares. Notably, the 
two largest sources of aggregate emissions in the 
EPA GHGI—pneumatic controllers and equip- 
ment leaks—were never observed from these 
aerial surveys. Similarly, a national survey of 
gathering facilities found that emission rates 
were four times higher at the 20% of facilities 
where substantial tank venting emissions were 
observed, as compared to the 80% of facilities 
without such venting (25). In addition, very large 
emissions from leaking isolation valves at trans- 
mission and storage facilities were quantified by 
means of downwind measurement but could not 
be accurately (or safely) measured by on-site 
methods (26). There is an urgent need to com- 
plete equipment-based measurement campaigns 
that capture these large-emission events, so that 
their causes are better understood. 

In contrast to abnormal operational condi- 
tions, alternative explanations such as outdated 
component emission factors are unlikely to ex- 
plain the magnitude of the difference between 
our facility-based BU estimate and the GHGI. 
First, an equipment-level inventory analogous 
to the EPA GHGI but updated with recent di- 
rect measurements of component emissions (ma- 
terials and methods, section S1.4) predicts total 
production emissions that are within ~10% of 
the EPA GHGI, although the contributions of 
individual source categories differ significant- 
ly (table S3). Second, we consider unlikely an 
alternative hypothesis that systematically higher 
emissions during daytime sampling cause a 
high bias in TD methods (materials and meth- 
ods, section S1.6). Two other factors may lead 
to low bias in EPA GHGI and similar inventory 
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Table 1. Summary of this work’s bottom-up estimates of CH, emissions from the U.S. oil and 
natural gas (O/NG) supply chain (95% confidence interval) and comparison to the EPA 


Greenhouse Gas Inventory (GHGI). 


Industry segment 


This work (bottom-up) 


Production 


2015 CH, emissions (Tg/year) 
EPA GHGI (17) 
7.6 (+1.9/-1.6) 3:5 


0.72 (+0.20/-0.071) 


U.S. O/NG total 


0.44 (+0.51/-0.22) 


13 (42.1/-17) 


8.1 (4+2.1/-1.4)' 


*This work's emission estimates for these sources are taken directly from the GHGI. The local distribution 
estimate is expected to be a lower bound on actual emissions and does not include losses downstream of 
customer meters due to leaks or incomplete combustion (materials and methods, section S1.5). 


+The GHGI only reports industry-wide uncertainties. 


A . B 
Haynesville (7.7 bef/d) —_—_@ | 0.0125 
Barnett (5.9 bef/d) _ | 
Northeast PA (5.8 bct/d) a 0.0100 
San Juan (2.8 bef/d) a | Bottom-up 
Fayetteville (2.5 bet/d) oO | 2 0.0075 I Top-down 
Bakken (1.9 bef/d) —e—— 8 
Uinta (1.2 bef/d) +_@ — 0.0050 
Weld County (1.0 bef/d) 2} 
West Arkoma (0.37 bed) = —__>_@ 9.0085 
9-basin sum e— 
-100% -50% 0% 50% 100% 150% 0 200 400 600 
(TD-BU)/TD 9-basin sum, O/NG emissions (Mg CH./h) 


Fig. 1. Comparison of this work’s bottom-up (BU) estimates of methane emissions from oil 
and natural gas (O/NG) sources to top-down (TD) estimates in nine U.S. O/NG production areas. 
(A) Relative differences of the TD and BU mean emissions, normalized by the TD value, rank ordered 
by natural gas production in billion cubic feet per day (bcf/d, where 1 bcf = 2.8 x 10” m®). Error 

bars represent 95% confidence intervals. (B) Distributions of the nine-basin sum of TD and BU mean 
estimates (blue and orange probability density, respectively). Neither the ensemble of TD-BU pairs 

(A) nor the nine-basin sum of means (B) are statistically different [p = 0.13 by a randomization test, 
and mean difference of 11% (95% confidence interval of -17 to 41%)]. 


estimates. Operator cooperation is required to 
obtain site access for emission measurements 
(8). Operators with lower-emitting sites are plau- 
sibly more likely to cooperate in such studies, 
and workers are likely to be more careful to 
avoid errors or fix problems when measure- 
ment teams are on site or about to arrive. The 
potential bias due to this “opt-in” study design 
is very challenging to determine. We therefore 
rely primarily on site-level, downwind mea- 
surement methods with limited or no opera- 
tor forewarning to construct our BU estimate. 
Another possible source of bias is measurement 
error. It has been suggested that malfunction of 
a measurement instrument widely used in the 
O/NG industry contributes to underestimated 
emissions in inventories (27); however, this can- 
not explain the more than twofold difference in 
production emissions (28). 


The tail-heavy distribution for many O/NG 
CH, emission sources has important implica- 
tions for mitigation because it suggests that 
most sources—whether they represent whole 
facilities or individual pieces of equipment— 
can have lower emissions when they operate as 
designed. We anticipate that significant emis- 
sions reductions could be achieved by deploying 
well-designed emission detection and repair sys- 
tems that are capable of identifying abnormally 
operating facilities or equipment. For example, 
pneumatic controllers and equipment leaks are 
the largest emission sources in the O/NG pro- 
duction segment exclusive of missing emission 
sources (38 and 21%, respectively; table S3), with 
malfunctioning controllers contributing 66% of 
total pneumatic controller emissions (materials 
and methods, section $1.4) and equipment leaks 
60% higher than the GHGI estimate. 
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Gathering operations, which transport unpro- 
cessed natural gas from production sites to pro- 
cessing plants or transmission pipelines, produce 
~20% of total O/NG supply chain CH, emissions. 
Until the publication of recent measurements 
(29), these emissions were largely unaccounted 
by the EPA GHGI. Gas processing, transmission 
and storage together contribute another ~20% 
of total O/NG supply chain emissions, most of 
which come from ~2500 processing and com- 
pression facilities. 

Our estimate of emissions from the U.S. O/NG 
supply chain (13 Tg CH,/year) compares to the 
EPA estimate of 18 Tg CH,/year for all other 
anthropogenic CH, sources (17). Natural gas 
losses are a waste of a limited natural resource 
(~$2 billion/year), increase global levels of sur- 
face ozone pollution (30), and substantially erode 
the potential climate benefits of natural gas use. 
Indeed, our estimate of CH, emissions across 
the supply chain, per unit of gas consumed, re- 
sults in roughly the same radiative forcing as 
does the CO, from combustion of natural gas 
over a 20-year time horizon (31% over 100 years). 
Moreover, the climate impact of 13 Tg CH,/year 
over a 20-year time horizon roughly equals that 
from the annual CO, emissions from all U.S. coal- 
fired power plants operating in 2015 (31% of the 
impact over a 100-year time horizon) (materials 
and methods, section S1.7). 

We suggest that inventory methods would be 
improved by including the substantial volume 
of missing O/NG CH, emissions evident from 
the large body of scientific work now available 
and synthesized here. Such empirical adjustments 
based on observed data have been previously used 
in air quality management (37). 

The large spatial and temporal variability in 
CH, emissions for similar equipment and fa- 
cilities (due to equipment malfunction and other 
abnormal operating conditions) reinforces the 
conclusion that substantial emission reductions 
are feasible. Key aspects of effective mitigation 
include pairing well-established technologies 
and best practices for routine emission sources 
with economically viable systems to rapidly de- 
tect the root causes of high emissions arising 
from abnormal conditions. The latter could in- 
volve combinations of current technologies such 
as on-site leak surveys by company personnel 
using optical gas imaging (32), deployment of 
passive sensors at individual facilities (33, 34) 
or mounted on ground-based work trucks (35), 
and in situ remote-sensing approaches using 
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tower networks, aircraft, or satellites (36). Over 
time, the development of less failure-prone sys- 
tems would be expected through repeated ob- 
servation of and further research into common 
causes of abnormal emissions, followed by re- 
engineered design of individual components 
and processes. 
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EARLY DEVELOPMENT 


Dual-spindle formation in zygotes 
keeps parental genomes apart 
in early mammalian embryos 


Judith Reichmann’, Bianca Nijmeijer’, M. Julius Hossain’, Manuel Eguren’, 
Isabell Schneider’, Antonio Z. Politi’, M. Julia Roberti’, Lars Hufnagel’, 


Takashi Hiiragi”, Jan Ellenberg™* 


At the beginning of mammalian life, the genetic material from each parent meets when the 
fertilized egg divides. It was previously thought that a single microtubule spindle is 
responsible for spatially combining the two genomes and then segregating them to create 
the two-cell embryo. We used light-sheet microscopy to show that two bipolar spindles 
form in the zygote and then independently congress the maternal and paternal genomes. 
These two spindles aligned their poles before anaphase but kept the parental genomes 
apart during the first cleavage. This spindle assembly mechanism provides a potential 
rationale for erroneous divisions into more than two blastomeric nuclei observed in 
mammalian zygotes and reveals the mechanism behind the observation that parental 
genomes occupy separate nuclear compartments in the two-cell embryo. 


fter fertilization, the haploid genomes of egg 

and sperm come together to form the ge- 

nome of a new diploid organism, a moment 

that is of fundamental biological impor- 

tance. In mammals, parental chromosomes 
meet for the first time upon entry into the first 
zygotic mitosis after nuclear envelope breakdown 
(NEBD). It has been assumed that, as observed 
in oocytes, a single bipolar microtubule system 
would self-assemble around both parental ge- 
nomes in zygotes (1-6). However, owing to the 
extreme light sensitivity of the mammalian em- 
bryo, the details of the dynamic process of zy- 
gotic spindle assembly remain unclear. 

To examine how parental genomes join for the 
first time, we imaged live embryos in which the 
maternal and paternal centromeres were differ- 
entially labeled (7) using our recently developed 
inverted light-sheet microscope, which allows 
fast three-dimensional (3D) imaging of embry- 
onic development owing to its low phototoxicity 
(8). This revealed that the two genomes remain 
spatially separate throughout the first mitosis 
(fig. S1 and movie S1). To understand why the 
genomes are not mixed, we next imaged spindle 
assembly using fluorescently labeled microtubule 
organizing centers (MTOCs) and spindle micro- 
tubules (Fig. 1A and movie S2). Newly nucleated 
microtubules self-organized into two separate 
bipolar spindles after NEBD, attracting a subset 
of the cytoplasmic MTOCs that had accumulated 
around each pronucleus to their poles (Fig. 1, A 
and B). Subsequently, the two spindles aligned 
and came into close apposition to form a com- 
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pound barrel-shaped system. This structure typ- 
ically had two clusters of MTOCs at at least one 
of its poles, suggesting that the two spindles were 
aligned closely but not completely merged (Fig. 1, 
A and B, and movie S2). To probe this further, we 
performed 3D immunofluorescence analysis of 
zygotes, visualizing endogenous spindle poles, 
microtubules, kinetochores, and DNA. In early 
and mid pro-metaphase, two separate bipolar 
spindles were formed in in vivo-developed zygotes 
(Fig. 1, C and D). Given the delayed and partial 
association of MTOCs with the microtubule mass, 
we hypothesized that dual-spindle formation 
might be driven by self-assembly of microtubules 
nucleated by chromosomes. To test this, we as- 
sayed microtubule regrowth after washing out 
the microtubule-depolymerizing drug Nocodazole. 
Indeed, a large proportion of microtubules was 
nucleated on chromosomes, particularly at ki- 
netochores (fig. S2), whereas MTOCs became 
associated with microtubules only later. This ob- 
servation prompted us to investigate the organi- 
zation of K-fibers in zygotic pro-metaphase by 
means of high-resolution immunofluorescence 
of zygotes fixed after brief cold treatment in 
order to highlight stable microtubules (Fig. 1D 
and fig. $3). Two bipolar arrays of K-fibers started 
assembling in early pro-metaphase and were 
stably organized in mid pro-metaphase, and their 
remnants were clearly recognizable by their slight- 
ly offset centers and split poles even after par- 
allelization in metaphase, during which the two 
spindles were no longer separated by a large gap. 

To characterize the kinetics of zygotic spindle 
assembly in live embryos, we next imaged ma- 
ternal and paternal centromeres in relation to 
growing spindle microtubule tips and observed 
three phases of zygotic spindle assembly (Fig. 2A). 
A transient first phase (~3 min; 10.3 + 3.5 to 13.4 + 
4 min after NEBD), characterized by the clustering 
of growing microtubules around the two pronuclei, 
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was followed by phase 2 (~16 min; 14.5 + 4: to 30.7 + 
6.5 min after NEBD), in which individual bipolar 
spindles assembled around each parental genome, 
and subsequently, phase 3 (~83 min; 46.7 + 17 to 
129.2 + 16.5 min after NEBD), when the two 
spindles aligned and combined into a compound 
barrel-shaped structure. 

To test whether the two zygotic spindles are 
functionally independent, we measured the tim- 
ing and direction of maternal and paternal 
chromosome congression (fig. 54, A and B, and 
supplementary materials, materials and methods). 
Congression started in pro-metaphase (phase 2), 
while the spindles were clearly separated (fig. S4, 
A and B). Parental chromosome congression was 
not correlated in time until shortly before ana- 
phase, suggesting that they are moved by differ- 
ent microtubule systems (fig. S4C). Furthermore, 
the parental genomes were congressed in differ- 
ent directions along separate spindle axes, as 
evidenced by the large difference between the 
angles of the two forming metaphase plates, 
which became parallel only later during dual 
spindle alignment in phase 3 (Fig. 2, B to D, and 
fig. S4, D and E). Tracking of growing micro- 
tubule tips showed two major directions of 
microtubule flow during phase 2 and one main 
direction during phase 3, as indicated by the 
corresponding kymograph profiles (fig. S5 and 
movies S3 to S8). The independent congression 
frequently led to an offset in the final bioriented 
position of the paternal and maternal metaphase 
plates at the end of phase 3 before and during 
segregation (Fig. 2A, arrows). Thus, each of the 
two spindles around the parental genomes func- 
tions independently for chromosome congression 
and may even function separately in chromo- 
some segregation. 

In in vitro fertilization (IVF) clinics, the zygotic 
division is error prone and often leads to em- 
bryos with blastomeres that contain two nuclei 
(9-13). We hypothesized that failure to align the 
two parental spindles before anaphase could ex- 
plain this enigmatic phenotype. To test this, we 
increased the distance between the two pronuclei 
with transient treatment with Nocodazole, which 
led to a larger gap between the two self-assembling 
spindles (Fig. 3 and movies S9 to S11). Indeed, such 
embryos frequently failed to fully align the pa- 
rental spindles at one or both poles. This did not 
delay anaphase but resulted in chromosome seg- 
regation by two spindles into different direc- 
tions, leading to two cell embryos with one or 
two binucleated blastomeres (Fig. 3 and movies 
$10 and S11). By contrast, embryos that did align 
the two spindles parallel to each other before ana- 
phase cleaved into two blastomeres with single 
nuclei (movie S9). Thus, failure to align the two 
zygotic spindles gives rise to multinucleated 
two-cell embryos, phenocopying frequently 
observed errors in human embryonic develop- 
ment in IVF clinics. 

Dual-spindle assembly in the mammalian zy- 
gote would also offer a mechanistic explanation 
for the long-standing observation that the paren- 
tal genomes occupy separate compartments 
inside the nuclei of two- and four-cell embryo 
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blastomeres (14, 15). If dual-spindle assembly 
around two pronuclei was responsible for genome 
compartmentalization, parental genomes should 
mix in subsequent divisions, in which only one 
nucleus is present per cell. Imaging of the meta- 
phase plate of live hybrid mouse embryos from 
the zygote to the eight-cell stage showed that 
parental genomes were separated in zygotes but 


rapidly became mixed in the subsequent devel- 


Fig. 1. Individual bipolar 
spindle formation 
around each pronucleus. 
(A) Time-lapse imaging of 
Mus musculus x Mus 
musculus (MMU x MMU) 
zygotes expressing EB3- 
mcCherry (marker for 
microtubules; green) and 
tdEos-Cep192 (marker for 
MTOCs; magenta). Scale 
bar, 10 um. In 10 out of 
13 zygotes, both or at 
least one of the dual- 
spindle poles remained 
clearly split after the 
spindles had parallelized. 
(B) Schematic diagram 
showing progression of 
dual-spindle formation 
based on the data 
presented in (A) and data 
shown subsequently in 
the manuscript. Micro- B 
tubules, gray; MTOCs, 
magenta. (C) Immuno- 
fluorescence staining of 

MU x MMU zygotes 
ixed at consecutive 
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tdEos-Cep192 


Cep192 
stages of development. 
aximum z projections of Cc 
confocal sections of Prophase 


zygotes at prophase, early 
pro-metaphase, late pro- 
metaphase, and early 
metaphase. White arrow- 
heads indicate poles. 
(D) Immunofluorescence 
staining of cold-treated 
MU x MMU pro- 
metaphase zygotes. 
aximum z projections of 
confocal sections. Micro- 
ubules, o-Tubulin (green); 
TOCs, Pericentrin 
(magenta); kinetochores, 
Crest (white); and DNA, 
Hoechst (blue). Scale 
bars, 5 wm in (C) and (D). 
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opmental stages, as predicted (fig. S6, A to D). 
This loss of genome compartmentalization was 
also seen in in vivo-developed isogenic embryos 
(fig. S6, E to I). Thus, parental genomes are kept 
separate by two spindles only during the first 
mitosis but then mix during subsequent divisions, 
driven by a single common spindle. 

If dual-spindle assembly is the mechanism for 
parental genome compartmentalization (fig. S7A), 


21.8 min 


Microtubules 


Early Pro-Metaphase 
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Late Pro-Metaphase 


formation of a single spindle around both genomes 
in the zygote should already mix them in the first 
division. To test this prediction, we redirected 
spindle assembly with two small-molecule inhib- 
itors of microtubule polymerization (Nocodazole) 
and the motor protein Eg5 (Monastrol). Transient 
treatment with Monastrol collected both genomes 
in a single microtubule aster, and subsequent 
depolymerization of microtubules with Nocodazole 
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followed by regrowth after washout then resulted 
in one bipolar spindle around both genomes 
(figs. S7B and S8, MoNoc-treated zygotes). Such 
MoNoc-treated embryos captured and congressed 
chromosomes within a single spindle and showed 
a high degree of parental genome mixing in the 
first mitotic metaphase (Fig. 4). This was substan- 
tially different from untreated or control zygotes 


in which the order of drug treatments is reversed 
(NocMo-treated zygotes), which maintained dual- 
spindle formation and genome separation (Fig. 4 
and figs. S7C and S8). Thus, dual-spindle forma- 
tion in the zygote is required for parental genome 
separation in mammals. 

Having this experimental method to induce 
mixing of the parental genomes in hand allowed 


us to demonstrate that genome separation was 
not required for parental genome epigenetic 
asymmetry and its resolution (6-19), as proposed 
previously (figs. S9 and S10) (15, 20-22). 

Here, we showed that two spindles form 
around pronuclei in mammalian zygotes, which 
individually collect the parental genomes and 
then position them next to each other before the 
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Fig. 2. Spindle assembly and chromosome dynamics in the zygote. 

(A) Time-lapse imaging of Mus musculus x Mus Spretus (MMU x MSP) 
zygotes expressing EB3-mCherry and fluorescent TALEs to label maternal 
and paternal chromosomes though distinction of Major satellites (MajSat) 
and Minor satellites (MinSat). Phase 1: Microtubule ball formation around 
pronuclei. Phase 2: Bipolarization of maternal and paternal spindle. Phase 3: 
Formation of single barrel—-shaped spindle. (Top and top middle) 3D-rotated 
images of the whole spindle volume. Maternal chromosomes, MajSat 
(magenta); paternal chromosomes, MinSat (cyan); microtubules, EB3- 
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2 and single bipolar spindle in phase 3 (CompoundSpd; gray). Offset between 
maternal and paternal chromosomes at metaphase and anaphase is 
indicated with white arrows. (B) Schematic of measurements on maternal 
(MatChr) and paternal chromatin masses (PatChr) in phases 2 and 3. 

(C and D) Angle between maternal and paternal chromosome axis over time 
for (C) a single embryo and (D) averaged for 12 embryos (mean + SD) is 
shown. Phase 1, blue; phase 2, red; phase 3, green. 
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Fig. 3. Proximity dependency of bipolar spindle fusion. (A) Time-lapse 
imaging of MMU x MMU zygotes expressing H2B-mCherry (chromatin; 
magenta) and aTubulin—enhanced green fluorescent protein (EGFP) 
(microtubules; green). Shown is spindle morphology from pro-metaphase 
to postmitosis in three representative zygotes treated with Nocodazole 
for >10 hours. Maximum z projections are of pro-metaphase, metaphase, 
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Fig. 4. Distribution of parental centromeres in control, NocMo-treated, 
and MoNoc-treated zygotes. (A) Differential labeling of maternal (MajSat; 
magenta) and paternal (MinSat; cyan) centromeres through distinction 

of single-nucleotide polymorphisms by means of fluorescent TALEs. 
Mitotic spindle is labeled with EB3-mCherry (white). Representative 
Z-projected images of parental chromosome distribution in untreated, 


MoNoc Zygote 
(one spindle) 
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anaphase, telophase, and postmitosis. Arrowheads and “PB” indicate 
nuclei and polar body, respectively. Scale bar, 10 um. In the absence of 
NEBD as a timing reference, anaphase onset was set at 90 min (average 
time from NEBD to anaphase in MMU x MMU zygotes), and the other 
times were calculated accordingly. (B) Initial distance of pro-nuclei 

(n = 19). Statistics, Student's t test. 
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MoNoc, and NocMo MMU x MSP zygotes. Scale bar, 10 um. (B) Degree 
of overlap between 3D convex hulls of parental chromosomes for 
untreated (n = 31), MoNoc (n = 16), and NocMo (n = 12) zygotes 

and embryos with in silico randomized distribution (n = 40) (fig. S1 
and supplementary materials, materials and methods). Statistics, 
Student's t test. 


first anaphase by aligning the two spindles par- 
allel to each other. Our data explain how parental 
genome separation is achieved in mammalian 
embryos. To date, the formation of physically 
distinct mitotic spindles around the two pro- 
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nuclei has been thought to be specific to certain 
arthropod species (23, 24). Our finding that this 
occurs also during mammalian pro-metaphase 
suggests that two zygotic spindles might be 
characteristic for many species that maintain 
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separate pronuclei after fertilization. Failure to 
align the two spindles produced errors in the 
zygotic division that closely resemble clinical 
phenotypes of human embryos in IVF procedures, 
suggesting that a similar mechanism of dual 
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zygotic spindle assembly also occurs in humans. 
This view is supported by the spatial separation 
of parental chromosomes reported in human 
zygotes (25) and divisions of human zygotes into 
multinucleated blastomeres, as well as by reports 
that identified a mix of paternal, maternal, and 
diploid cells in eight-cell cattle embryos. A dual 
zygotic spindle would provide a mechanistic 
basis for this parental genome segregation 
(9-13, 26, 27). These severe and relatively frequent 
zygotic division errors in human and agricul- 
turally used mammals thus find their likely 
mechanistic explanation in a failure of the close 
alignment of the two zygotic spindles before ana- 
phase. If a similar mechanism of microtubule- 
driven parental genome separation indeed occurs 
in human zygotes, it would be important from an 
ethical and legal perspective because “pronuclear 
fusion,” a process that strictly speaking does not 
occur in mouse zygotes, is used to define the 
beginning of embryonic life as protected by 
law in several countries (for example, Germany, 
§ 8 Abs. 1 Embryonenschutzgesetz). 
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goal of PINS Medical for “restoring hope”, not simply as an innovation company but also across society to citizens. 


Looking into the future with the continuous rise in incidence of neuropsychiatric disorders and increased social burden 
across the globe, PINS Medical along with local governments, research centers, companies and top academic scien- 
tists, are now developing and promoting innovative therapies worldwide. 
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Technische Universitat Darmstadt is one of Germany’s 
leading technical universities. Its main areas of 
research, teaching and transfer are engineering and 
natural sciences, which cooperate closely with selected 
humanities and social sciences. TU Darmstadt with its 
approximately 25,000 students is the only technical 
university in Hessen. It is integrated into the economi- 
cally strong Rhine/Main/Neckar region and has an 
effective international network. 


We are looking for an academically proven leader with 
international experience who will develop the profile of 
TU Darmstadt in a creative, committed and competent 
manner, taking into account the legally stipulated 
autonomy. The notable strengths of TU Darmstadt in 
interdisciplinary cooperation between engineering, nat- 
ural sciences, humanities and social sciences are to be 
expanded and there is to be a particular focus on active- 
ly shaping the excellence and internationalisation strat- 
egy. TU Darmstadt lives a culture of open communica- 
tion and expects the applicant to be able to integrate 
and cooperate both internally and externally. 


In accordance with the requirements of Section 39 Hes- 
sisches Hochschulgesetz [Higher Education Act of the 
State of Hessen], the President is elected by the Univer- 
sity Assembly on the proposal of the University Council. 
Adequate knowledge of German and a sound knowl- 
edge of the German higher education system are 
expected. The remuneration is negotiated with the Hes- 
sen State Ministry for Higher Education, Research and 
the Arts. 


TU Darmstadt expressly welcomes applications from 
women. Persons with a degree of disability of at least 
50 or those of an equal status are given preference if 
they are of the same suitability. 


Applications including substantiating documents must 
be sent to the Chairman of the Selection Committee, 
Professor Ferdi Schuth, TU Darmstadt, Referat 
Qualitatsmanagement und Gremien (IB), Karolinen- 
platz 5, 64289 Darmstadt or by e-mail to 
findungskommission@tu-darmstadt.de. 


Application deadline: September 1, 2018 


e3UR ) WILMOT 


MEDICINE CANCER INSTITUTE 


Postdoctoral Fellow / Staff Scientist Positions 
in Cancer Epigenomics 


The newly established Laboratory of Cancer Genomics in the Wilmot Cancer 
Institute at the University of Rochester Medical Center is seeking productive and 
highly motivated individuals interested in cancer epigenetics/genomics. Current 
studies make use of molecular genetic and genomic approaches to understand 
epigenetic control of transcription and its role in phenotypic plasticity, intratumor 
heterogeneity and the emergence of invasive or drug resistant properties. 
Translational epigenomics studies in human cancers are focusing on defining 
epigenomic/genomic signatures associated with environmental exposures and 
patient outcomes with the goal of identifying novel routes for cancer therapy. 


Applicants should hold a PhD in the biomedical sciences, computational biology 
or related discipline with experience in molecular/cancer biology, (epi) genetic/ 
omics, and/or bioinformatics. Wet and/or ‘dry’ lab experience with epigenomics 
technology/data (WGBS, ChIP-seq, ATAQ-seq, Pro-seq/RIP-seq), single-cell 
approaches, RNA biology, gene editing, or cancer models (3D culture, spheroid/ 
organoid) are highly desirable, but applicants with a strong background 
in a related area, track-record of accomplishment, and desire to work in a 
multidisciplinary team of investigators focused on cancer epigenetics will be 
considered. Computational scientists (MS/PhD) with expertise in bioinformatics, 
systems biology and machine learning approaches are also encouraged to apply. 
Candidates should have excellent scientific writing and oral communication 
skills, as well as the ability to work effectively and collaboratively with others. 
Information about living/working at University of Rochester can be found at 
https://www.rochester.edu/working/hr/relocation/ 


Interested applicants should submit a cover letter, CV and contact information 
for three references to: Ms. Elva Mikk, Wilmot Cancer Institute, University 
of Rochester, 601 Elmwood Avenue, Rochester, NY 14642, Box 704 or e-mail 
at: elva_mikk@urmc.rochester.edu. 


The University of Rochester is an Equal Opportunity Employer and has a 
strong commitment to diversity and actively encourages applications from 
candidates from groups underrepresented in higher education. 
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SCHOOL OF ENGINEERING 
AND TECHNOLOGY 


INDIANA UNIVERSITY-PURDUE UNIVERSITY 


IUPUI 


Indianapolis 


Chair, Department of Biomedical Engineering and Thomas J. Linnemeier Guidant Foundation Chair 


The Department of Biomedical Engineering in the Purdue School of Engineering and Technology at Indiana University-Purdue University Indianapolis 
(IUPUI) is seeking highly qualified individuals to apply for the position of Department Chair and holder of the Thomas J. Linnemeier Guidant Foundation 
Chair in Biomedical Engineering, an endowed, tenured position at the rank of Professor. 


This senior level leadership position within the School is intended for candidates who are dedicated to the advancement of undergraduate and graduate 
education, have a distinguished record of research and professional accomplishment, have strong leadership and management abilities, and a steadfast 
commitment to faculty governance. This is an outstanding opportunity to impact the future direction of the department and its programs with the potential 
to guide the department’s ascension to among the premier departments of its kind in the nation. 


Applicants must have a Ph.D. in engineering or a related discipline and a well-documented research career evidenced by significant publications and 
extramural research funding. The search encourages applications from not only well-established Professors but also from rapidly progressing Associate 
Professors whose work would merit promotion to Professor. The incumbent’s research would have a primary application in an established area of biomedical 
engineering such as biomaterials, biomechanics, or bioinstrumentation. Other areas in BME will be considered. Besides leading and managing the department, 
the successful candidate will be expected to establish a state-of-the-art research program in collaboration with the Indiana University School of Medicine, 
also on the IUPUI campus. Major institutes and centers at IUPUI include the Biomechanics and Biomaterials Research Center, the Indiana Center for 
Musculoskeletal Health, the Simon Cancer Center, the Krannert Institute of Cardiology, and the Stark Neurosciences Institute. A qualified candidate may 
be offered joint appointments in both engineering and medicine. A more complete description of BME at IUPUI is available at www.engr.iupui.edu/bme/ 


The Purdue School of Engineering and Technology at IUPUI has 7 academic departments with an extensive undergraduate and graduate degree portfolio, 
including numerous CAC, EAC, and ETAC of ABET accredited B.S. programs. The School has over 3,500 students, including approximately 500 graduate 
students pursuing either M.S. or Ph.D. programs. More information about the School is available at engr.iupui.edu/ 


The IUPUI campus has approximately 30,000 students and over 200 degree programs with annual research expenditures totaling over $300M. IUPUI has 
received the Higher Education Excellence in Diversity (HEED) Award for five consecutive years. More information about IUPUI is available at www.iupui.edu 


Applications must include a statement of interest, curriculum vitae, and contact information for at least three references. Apply for this position at https:// 
indiana.peopleadmin.com/postings/4229. Applications are welcome until the position is filled. 


IUPUT is an Equal Opportunity/Affirmative Action educator and employer and affords reasonable accommodations to persons with disabilities. 
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UNIVERSITY OF MINNESOTA 
Driven to Discover™ 


Faculty Position in Cardiac/Skeletal Muscle 
Molecular and Integrative Physiology 


The Department of Integrative Biology and Physiology at the University 


of Minnesota Medical School invites applications for a tenure-track faculty 
position. 


We seek outstanding scientists who will establish a competitive extramurally 
funded research program that focuses on molecular and integrative physiology 
of cardiac and/or skeletal muscle biology. Candidates from all areas of cardiac/ 
muscle physiology, from gene to whole organism, are sought including 
candidates with interests in muscle stem cells, metabolism, and aging 
are particularly encouraged to apply. This position is open to candidates 
at all ranks and offers exceptional startup support, a dynamic intellectual 
environment, state-of-the-art facilities, a competitive salary, and quality 
research space within the Department of Integrative Biology and Physiology 
https://www.physiology.umn.edu. 


Minimum qualifications: Ph.D., M.D., or equivalent in a relevant field 
of study, plus applicable postdoctoral or faculty experience. A creative and 
innovative research program is essential, current funding is not required. To 
apply, please upload a curriculum vitae and concise summary of current and 
planned research in response to job opening ID 324849 or 324853 at http:// 
www1.umn.edu/ohr/employment. Please also arrange to have three (3) letters of 
recommendation sent to jotto@umn.edu or Search Committee, Department 
of Integrative Biology and Physiology, Medical School, 6-125 Jackson Hall, 
321 Church Street S.E., Minneapolis, MN 55455. Review of applications 
will begin October 1, 2018 and continue until the position is filled. 


The University of Minnesota provides equal access to and opportunity in 
its programs, facilities, and employment without regard to race, color, 
creed, religion, national origin, gender, age, marital status, disability, public 
assistance status, veteran status, sexual orientation, gender identity, or gender 
expression. To learn more about diversity at the U: http://diversity.umn.edu. 


The University of Exeter Medical 
chool is excited to invite applications 
for our two new Mireille Gillings 
Professorial Fellowships in Health 
Innovation, with very competitive 
salaries plus unique programme grants 
of up to £Im and leading to tenured 
positions after five years. We are 
seeking to recruit talented researchers 
0 are looking to establish 
themselves as future leaders in their 
respective fields and establish a tenured 
career at Exeter. We are particularly 
interested in individuals with research 
interests across: Neuro imagining, PET 
imaging, Drug discovery, Public health, 
Diabetes, Primary care, Dementia and 


wo 


< 


Genomics. 


Current opportunities include two 
Mireille Gillings Professorial Fellowships 
in Health Innovation (5 years), 

leading to tenured positions. A very 
competitive salary package plus unique 
cé£1m Fellowship Fund for the right 
candidates. 


Tailored and personalised support 

will be provided for leadership and 
entrepreneurial activities across area 
such as Executive Education, Finance, 
Coaching and/or MBA studies, 
depending on individual requirements, 
with the aim of creating health leaders 


who can navigate the complexity 
of partnering medicine and money 
and steer society through the health 


challenges of the future. 


Interested? Full details are available at medicine.exeter.ac.uk/about/joinus 
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WORKING LIFE 


By Tracy Evans 


198 


Heed the call to change 


needed a change. Just a few years earlier, I stood at the edge of the swamp under a massive hollow 
cypress tree reading Winnie-the-Pooh to a group of 5- and 6-year-olds. As a naturalist at a state 
park, my goal was to introduce these children to the nature found in their own backyard. I played 
a tape of owl calls to accompany the story. Just then, an owl swooped down over our heads, star- 
tling and delighting us. “I can’t believe I’m actually getting paid to have this much fun!” I thought 
to myself. But over time, my job changed. Sitting at my desk, staring at environmental impact re- 
ports and grant applications on my computer screen, I began to think, “They cannot pay me enough 
to do this job.” It was another turning point in the winding road that led me, at age 66, to earn a Ph.D. 


My first career had been audiology— 
a pragmatic choice based on job op- 
portunities in a rural area, and one 
that allowed me to work part time 
while tending my growing family. 
But when my children grew up and 
left home, I found myself longing 
to be outside, not in a soundproof 
room in the hospital basement. 

I began attending ornithology 
classes at the local university, hop- 
ing that might lead me to new 
career prospects. I also went to 
see a career counselor, offered as 
part of a university program for 
nontraditional students. “What 
gave you pleasure as a child?” the 
counselor asked. I thought for a 


“Twas 60 years old. ... 


part of a field research project. It 
wasn’t quite the house-share plan, 
but it had the appeal of being some- 
place new. 

I found that field research al- 
lowed me to play in the woods 
again, but with a new purpose. I 
was able to combine my childhood 
delight in nature with the intellec- 
tual stimulation that comes with 
figuring things out—pleasures that 
had become all too rare in my job. 

At the end of my stay, I was in- 
vited to participate in the “sand- 
wich” Ph.D. program, in which 
students do research in their home 
country while receiving support 
and instruction in the Netherlands. 


while before responding, “Playing But I wanted to follow my I was 60 years old. I didn’t think 
outside in the woods.” ° ” that earning a Ph.D. would further 
Our discussion eventually led to new passion. my career. But I wanted to follow 


a second master’s degree, this one 

in zoology, and the job as a naturalist at a state park. It 
was an entry-level position. I was 44 years old. And I was 
delighted to once again experience the joy of being out- 
doors that I had felt as a child. 

But as I climbed the career ladder at my state’s De- 
partment of Natural Resources, from naturalist to grant 
compliance officer to managing environmental reviews 
of publicly funded conservation projects, my time in the 
field diminished and finally disappeared. I was chained 
to a computer screen. In the back of my mind, the career 
counselor’s question smoldered. “What gave you pleasure 
as a child?” 

As my discontent grew, life went on. At a dinner party, my 
friends and I discussed the perfect retirement. We agreed 
that, if money were not an issue, we would each purchase 
a home in a different part of the world and rotate among 
them. Shortly thereafter, an opportunity presented itself: 
a 4-month sabbatical at a university in the Netherlands as 


my new passion. 

I figured that I could work full time at my day job in the 
state government and conduct my research during evenings 
and weekends. Still, my colleagues raised their eyebrows. 
“What do you intend to do with a Ph.D.?” they asked. “It is 
all about the journey,’ I responded. 

I successfully defended my dissertation 6 years later, 
3 years after retiring from my government position. 

My career path has taken some sharp zigs and zags over 
my 40 years in the workforce. And each decision I made 
was the right one for me at the time. But when they were no 
longer right, I changed direction. No career decision has to 
be final. As one of my favorite authors, Ursula K. Le Guin, 
wrote, “It is good to have an end to journey towards, but it 
is the journey that matters in the end.” 


Tracy Evans is an associate researcher at the Illinois State 
Museum Research and Collections Center in Springfield. 
Send your career story to SciCareerEditor @aaas.org. 
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